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COMPOS* i IONS <LNfD MP nW>f>S Ff *R I ITIi THFRAPY ANT* l>\ AviK? >SIS ol 
PROS I AC* CANaft 

5 3 ?t 4 HN 10 A £ Hi .DfiHi it 1N\ LK I ION 

IK* pjfc&nt unem^oa ^jctdSJv tit ^KtHpv <n^ i *** 

u>t\kr oUvh *s ppA^te vAiu<,r 111? imotiUeu ^ time fimlK i?\K«ji to 

puhrudi ttf Jo s.ru\h*U fv\ pfjtfsilt Sihii pohpepittitA ^J jyhna<.l:;tiKle\ 

JtU kviF^l MJOV U** IWINUON 

^.wot is. ^ ii^iikji 1 1 e ^ & pro >k*ti irssoaghi «t \si. kvi Id A ^hat. 
Cmu is a jigiiLk^s it vNr-fctli pr>bkir tbroy&bout the wmltf \kh *u*i <kiujKk-> Ktt<? 
IS ^cir= <n *n litrttVn^ at J theop\ >sj t - m <w si< <x oiH* wi^ *^iy ^yc^aM 

pftxu njsli n M\ut> [uticiVi. 

Vv^e < itl^t is m> .omrw fWm ot t^nca v^**g nusV %%*h 

ouueace sk -vs that Huffititi iWitaV 4,dii(Lt;*' Lt^ the pr *ptiw*\ tu l^tiALtasi-e ti bune, 
utiid the dt^L^L appears to proiijvss uu-vitablv frmi rn^kv^n dcpcs^nt Jo ^drotien 
fcttsKtory <*trixi* kvsdiog lu mo^ased p^?ui 1 fMi> ^l^^tU ds.Cd^ is 

' utrently the ^\4*ts>I k*bj# ^usp ot cancel *kvslh a;ocs^ s m (be T ' y 

i.ji\*;r Ttuw:tv- <i*tVknH CnwaJ C^m^K^h, incite icnt Ig^ii e^ft ^ur^ry <tfh$tw 

3hwm*\^ % .1^??fiekf pro*Wl? spec?tk pro^sns - pn^&t^Se s|mcific ^jfegers M^A) 

i 



3B& prostatic *cfci phosphatase fPAP) * ha« lim*<ri therapeutic *c.d d^stk 
^.tenl^ For cxrfupfr. PSA lcvtfi do ao* ^wy, wrrttwe ^11 *uh Owr fr^e of 
F ^Ule cmwt-r, being pusitWe m » p^n^as oi *w>-pu>^ um,or ^s, HH-fodisg 
bwaign pr.ofcitic by W fcsia *BPH>- Furthermore, sK^JWtncisls conebte with 
5 prostate vulume. and Oo aot usJscaie Ac tevet of raetastasis. 

In snite of t wearable research sntr? ihwajfues far ftese awl other 
cancers, pwsaw c&icef rrajaira diSjceii to djwsnase sad tot **Mvdy. Accordingly, 
thtsra is * t«*d m the art fw improved methods tl* <fcs«*tf »g and totaiiflft such ^xr&. 
The present ia^nibn foliiUs iKtfis and further pwvides c*er «tated advantages 

1 0 SUMMARY OF THE TKV ENTION 

v, <ne *tvU tk pit \&>n\^m ^io^k> t^H^Utot^te 

x4t ^iftwu'td n MOW No Ml SIM-'' l'M7^ 

15 s ^ ^0 W ^ r ^ >V V)1 o*8-h2<' M0,ertl 

D ^ f 4 o 6 .^^ ^^bl "Ml 7^,- ■ 7-.— ^64 

^t>f t^mr»»tK^ <>* ft* ^pra^es asxWed *n ^FQ B> V> 1-11 1 
l T \ I * 17* P> P<? W ^ OK ^0, VP US ^\ ^81 

tm ^ rr ""7^ ^ ?^ 7&* t 

^e^ F ^ili *siou>no i n\ iimm n > n~ » w->in 

2S r> f> ^ s u^^v -u&v^ ^1 t ^:and ^4 4^^^^^ ^1 iH >»s 
">r '1 TJto * < -' T 2L 7"^ n".^ 7^1 75S^^5 " ^-7?^ «a4 ^ ^ t 
1-111 ,|Mn,UM^ 79-iO' 107*1\ m "V-V^ >4^v 
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m. m mA 384-476, 524, 526 ; 530, SSL 533, 335, S36, 352, 36^572, 587, 59!., 593- 
606, 619-626, 630, 631, 634, 636, 639-655, 674,686, 6Ri, ?U, 713, 716, 720-722, 735, 
737-739, 75 L 753, 764, 765, 773-776 sad 786-7F8, under moderrtdy ssraigoit 
ceadttiom; 

S {ts} s&queijeei hsvsji* a? leavs 75% identity to a sequence of SfvQ ID 

NO- MIL 115-171, 173*175. 177. 179405, 307-315, 326, 328, 330, 332*335, 340- 
375, 381, 382 m& 384476. 524, 526, 550, 531, 533, 535, 536, 552, 569-572. 587. 591, 
593-606, 618-636, 630, 631, 654, 636, 639-655, 674, 680, 6*1, 711. 713, 716, 720-722, 
735, 737-739, 751, 753, 764, 765, 773-776 an*! 786-788; 

10 ft) sequences having &* 91}% sdemity ii* a !*eque5>e« r<f SEQ ll> 

MO: 1-1 II, 115-17], 173-175, 177, 179-305, 307-315, 326, ,128, 330, U2-335, 340- 
375, 381, 382 and 384-476, 524, 526, 530, 531, 533, 535, 536., 552, 569*572* 537, 591, 
593-606, 618-626, 630, 63 § , 634, 636, 639-655, 674, 680, 681, 71 1, 713, 716., 720-722, 
735, 737-739, 751, 75*, 764, 765, 773-776 and 7S6-7SS; and 

IS (g) degemr&te variants of a sequfcrsee provided m S&Q ID NO, 1- 

i U, 1 15-171, 173075, 17?, 17S-305, 307-315, 326, 321 330, 332-335, 340-375, 381, 
3S2 and 384-476, 524, 526, S30, 531, S33, 535, 536, 55i, 569-572, 587, $91, 593-606, 
618-626, 630, 631,634, 636, 639-655, 674, 680, 681, ?U, 713, 716, 720-722, 735, 717- 
739, 751, 753, 764, 763, 773*776 arid ?»&*?« 8. 

20 In one preferred emk^iraent, dre polynucleotide compositions of the 

invasion are c\press$d in least about 2.0%, roor« prdferabfy $n at kstst sho«f 30%, 
arsd masi preferably it's at ie&st aboitf 50% of private fj^rue samples letted, at a level 
tbat is st Icasr about 2-fold, preferably at i^t afanut 5-iokl and ms>sr preferably m tesst 
■3 boi.it 10- fold higher than that lor other iKJrmsj Uj^uts. 

25 Th? pmsrot mvtmr$ots, in mother- Aj.pvct, provkks polypepdde 

TO?npo,S5tjo^ oojijprisirig «k smmo acid sttpisisee that is encoded by a polynucleotide 

The prevent mvaunsn further pswkle^ po!} peptide ^rsipnsuion;; 
tosnpn^bg an inihio add .w^u^tie selected from the group emu^stifig of sequci*c«s 
3ft icthed in SHQ ID NO; 1 12-1 H 172, 176, 178, 327, 329 331, 336, 339, 376-380, 383, 



iT-~M\ 4^. %4, >rt\ s20, ^?2 52 v ^ <\ ^ 7 -:»l, s ^ *7^86- 

7ir, 7* 7-7iy, "^-m ^6, H0-"SO, "Vt 7*4, 7^>, 7^6-772. 7 -- T S5 a&d "S-*-?**! 
In tert^n pjuenea enK»djncnK the fnl\peptKk*' 
5 po>uiiOt&«nd^ s-t <ix. iwes-nt tt»«it^n **t Kirwwh'gtrac, < tlwy u^ah^ of 
eikrttni in irowonc av^oa^ *artw,uhrl\ a humoral ^d/or^UoL^ uithusi* 
ss tilths daubed hcRns 

The prswrm Jfttntum fiasbw proves *raf»ftn.tiK \ms&^ and/or 
J«»vams* ot the d.*cU*wd xrtwpuut awit* H^ 1 * 0 *'* 5 * ^'^^ 
10 f t YtiKfif , vm^ti iiNt dcmxii^vM r r***r*h a fc\ si of mmxu^wc at*t«t> 
at fcatf afrvit m>v ii-'fcf ,Hv at It ^ *h*it j«tfmN\ ,a k^t ?hou* 

of the ie\ol ot miasuvK^SPaw ad^v ot j poi>pei*dt ^que^s sci Sorth JnShg U) 
NO I \14 1 4 1 72, 1 1 7K 327 3^, 33 1 . .00 * 3*V^U, 383* 4 ? ^ 504, 

505 510, S2i), 52 \ ^s, S±\ "4. ?r-^51 533^68 ^St»*, SSS 5*U 62 1 
15 ^ 6 12 633, *>5, *37, ^ too bU, 675, bSX M*4, IKK ,N>, "M, 7iS "I? 

7M, ^ ! <j 74^ 'M, 7>t x j ??, ^7"? m ?8<J ot 3 pohpq^k 

ri K?. 1"<J 3rt\ VK UK ^2 335, i4lVi?? ( ^1, "^7 <iul 

?W-*7*> 5?4, 5-^^ 5 " >i >J>: "^ 5«j***?T?, 5b7 591 ^?-<w1b, 6^b-626, 

2i 630, 631,634, 636, 639-^55, 674, ^ ?13. 716, 720-722, ?35,737-?3<K 751, 

'753, 764, 765, 773-776 and 786-788. 

The fKesem invenijon turner provuk^ polyTOide«)U(ks that encode a 
polypeptide Above, cxpws^m %fclor^ awtprismg swih r^bnackoUdcs *jd 

bost ce^s transforai*!^ or transSefted with s«ch exp^sskvjt vectors. 
2 3 Within oter aspects, the presort mvemtiofi provides jdiarraaceutka! 

comfiosiUons cMinpn^ a pojypepticl* or polyn«v?eotidt as teHkd abow and a 
physiologically ac«cp&Ws earner, 

VLMtltin a rcl«ed aspect <.■! the |)r«senit inventicm, phansuwfliticfil 
corajwsUtHis. t?j?, vkIk cowposrioiuk are f4^ v ^ fof P^phjt^ t>r U»«M>ea^ 
30 applk&fciis. Such cfimrwwitioas eenciallv cotnprif* s*a ittim\mogenic polypepiide w 



polynucleotide of the invention and an nnmuntodmnlant, such as an adjuvant,. together 
wuh a physiologically acceptable carrier. 

"The present invention further provides pharmaceutical compositions that 
comprise art amtbody or antigen-binding fragment thereof that specifically bind;* to 
5 a polypeptide uf tht present i^eistiiR, or a fr&gsneiss thereof; and fbj a phystnH^jcally 
acceptable earner. 

Within tusth-e? aspect?, she prt^ent invention provides pharmtfrcc-^texy 
compositions comprising: ia) an ati%en ptes-entm^ cell that expr^s^cs a polypeptide as 
described above and ib) a pharmaceaikiilly acceptable carrier or t?xeipient. illustrative 
10 asugcr* presenting cdts melutk dendritic cells, m&ctoph&ges. monocytes, fibroblasts 
md B cells. 

Within related aspects, pharmaceutics*! emnpOMiions are provided that 
comprise: (a* an tmtiges! presenting eel! that expresses a polypeptide as described above 
aad lb} ?»n immunostimnlajnt 

1 5 The present uwemiu^ farther prmitk,\ m other aspects, fkskm prosein-s 

that toniprise at least one polypeptide as- described above as well as polvui*fleotjdes 
encoding such fission pro-terns, typically m the form of pharmaceutical compositk-i-aa* 
& g , vaceisje coriiprssisioa^ comprising physiologically acceptable carrier artd'or an 
irmftunostMriUlant. The fusie>sis proteins may emprise rxusMspk irttraunogeaie 

20 polypeptides, or pomo&s/vamets thereof, as describe*! herein, arsd may further comprise 
one or more polypeptide se-gmeats for facilitating an&mr enhancing she expression, 
purification; arid/or immvamgaaoiiy of the polypeptides). 

Witbm blither aspeets tine present invention, provide methods for 

75 patient eoiRpnsij^i athmsirtc-msg j ph^inice^ik^S e^mposkmn ik^vnbed her^m The 
patient may be aJUicfed wtt.li prostate cancer, in which case tine methods; provide 
beafment fortks dis£-aM> m a p&ki&L eossiderM to lie M risk for saeh a disease ns»y fee 

Within farther aspects, the present hjvrnriosi provides methods fur 
Mi uihihbrtg ibr lievel^pwn? uf » uitjscj iu a p^oent, an&prhmg administering m $ 



wo mm$m 



mtmt i **ainw* virtu* om|w.jts« v akifuL^ v *>k ticnt K, a*tlu tcsJ 
v t 5 i i i^u* ..u> n , «£t wliK §1 us * mt *h *i rr\* Mt ixaLTKru to* tK ih\ or & 

IV ftwr 4 iio u tion * io"tKr sxa\ ^ ^ v>stfc m ^tV t d puts Tm 'ifk^s 
5 rv re *clb tot* a hhMouoI «*mp e jrpn-wg vckick ^ a ^ »iog*ctl 

varp't vr^ t tMt <fjWkah ^tb a oohper^ ot ihe mention 
w'isrur SiK 5>te^ ccrttknng v prt" r?*ed. unck ctncHrh * >r t<-<iv M*ff -i~nt to 

x-ms* she - ati^\ si of \fh t ^pi^Njn^ tto p\% ptplK*e tn s (F t it «m 

itl ttf J ^wv- aetV^ v\ M M liaiihiUBv. tfr* 
10 <Ji vtli t ^ v^r ^> -i i p ,o i£ <\rm t^ru< % ^ * patent n Holc^n-t! 

MethA^ «e UmVi prouvi 1 wum t Kr tm Ht£mA«rr„ 

^wop-nina 1 cvtSvpecLlc for ipx tthtjpn.^t«weTt^cuiB^NWg 
at 1 csJL^ with no o- wr*- it trt <» ^ ^re^dSo as vksu^J aho\c, ^ i 
J 5 fv3vPLkko<Kie cicotfi^g 4 pohpkpfctfe 'in «ta ^.ntt^en pre cn'inc tell ftat 
-^prc^ve*. \kh i> p> \\pvjtid^ ufAt option <wk tar 1 ti^ m'^kviI to |*rm'f the 

W^Hd mriK>s tht pivtnt i^^ntm ifieL^s to^ 

k,eJ'^m-nf ot - v&. kci m ^ p it t t bing the ^tjf»«- ot x »\ uKabaMiE CD4* 

25 ^ifit^UK ** nri"t in 'nttu" utiiK p»^um i«t ^<KpL^A .Kttovx! Krot\ ^t) a 
vj^m^k^^k .vtodtOfc ^*-h «i oulvixp^s^ aiki taut tiUj^tai p^a'mg ul thU 
v\ptx ^ xu^b 1 pal r^tssi. ^^m^ in^ ta tb" ^^it art ^litLkstf woua* 

of t% f*ol &r«w- 1 ^IK thcTchv ur^ t>t v ^ rte^Jot 0^ it j var^tr m tu». pit«R* 
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Within tUrthej aspects, the present invention provides methods for 
deteswiimng the presence or absence of a cancer, preferably 4 proft&i-i cancer, in a 
patient comprising: {a) contacting a biological sample obtained from a patserst with a. 
bm-ding agent that binds to a polypeptide as recited above; (b) detecting in the sample an 
S amount of polypeptide? that binds to the binding agent; and (cji comparing the amount of 
polypeptide with a predetermined cutoff value, and Jhere&nra determining the psxssencs 
or absence of a e&neer in the patient. Within preferred etRb^imenis,, the binding agent 
is antibody, more preferably * rrKmodtinai anybody. 

lite pn^ni ttivtriiticn al$o presides, within nthar aspects, methods for 

10 ssonitorkg the progression of a cancer m a patient Such methods comprise the steps 
of (a) toniaciing a biological sample obtained from 3 patient at a first ptunt in time 
with a finding agent that binds to a poly* eptitfc as recked above, (b) detecting in the 
sample an ajnoajjt of polypeptide that binds to the l^nting agent; Ce) repeating steps- fa) 
and *b) a biologies! sample obtained irum die pattost 3? a Mi^stqucm point in 

.IS time; and {d) comparing the amo-unl of polypeptide detected in step te> with the amount 
defected in .step (b) and therefore monitoring the progression of the cancer in she 
psstieait 

The present Invention further provides, 'within other aspcels, methods for 
determining the presence qt absesnee of a cancer in a patient comprising the S>teps of: (a) 

20 competing a biological sample obtained from d parent with an obgasmtlemide that 
h>hndt^S5s ft* a polynucleotide oi tire present m\e»»ue, t^j dettotieg in die sample a 
level of a polynucleotide, preferably rnjRNA„ that hybridizes- to die oligoRudeolide; and 
fe) comparing the level of polynucleotide tfw hybridises to the oligonucleotide with a 
prakininined cut-oft* value, and thercfmrn determining she n^esice nt absence of a 

25 uaneer in the patient Within certain embodiments, the amrnmt u( naRNA is detected 
via jtolvnivTjt ^ t.h^n rieuO-n^ usmg. toi tvamrde, at iw one o-Ugotusek** tide pus-net 
that hybridises to a polynudeotiJe of the present invention, nt a complement of mcft. a. 
poIynt^leohJe Within othtsr embodiments the amount of mJIMA i dekcivd u.Mng a 
hybridation technic, employing an «digOfUieJeutnie piobe tht%t lr> bridles to an 

3# inventive polymer* Jtkle, or a eompkmeM of such $ pulyuudeotidc 
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1» misted asp^is, jnaftodb to prmtJed foa r^o^tou? sx the poa^^n 
>t a taiMxi >n u j^Kort, comirsmn dw step** ta> cc»n:t*vn£ * clerical Nintple 
«h\u>jd aom a patita* vutfe sir ^ligCTaeteoiiiJe that hybi]d:«» w a potvEmlwtida <>t 
fir \«rt*tnt ^irottniK jhj Uriecnt £ to tlw MarnJt *r h>wk wit «t .1 fv>ivnuckviide tart 
5 avlxtdires to- U*c ><ipvuekMiK ,0 ,irps te) ^ tM aiinc, 1 hnh^^ 

Tampk »>htMia<sJ 1*vwb tho pjivm a \ut«tquttit jv*nt m and uft pamR i3k 
pr>htiudco&J«> Jot>xtrd *oK\jm >hr mwiitt t U><vlu* ir iH ^?iJ 

thetetr^ oio^Kitm" Jx* p& ^osst-nt ^ tits cuwr u> the |uti <nt 

Wubut iuriber b,nt'civ psv^tit w a^ori ^mule\ amalvsJ^ ^udt a? 

diawsostK kifc corormiffc such sntit-'^dies lJiaj>oos*K wNrMtsm* ore nr tnoft 
c^iaonuckotw'e jrubes gjt §>niw* as feenbeJ abuv~ are «tl<o piovidci 

ihew &t*l otJkt &>f\4-is the presacrt mv^ti-n will btuin^ jppajteiai 
djv» tCtaeace to tk: fe^k^ng xktaikd tse^npaon ^raJ .Mudi^ <feattn;s 
IS 1 stereo cs Jiwi * <4 fteicn are hefrb* taeorpotated ^ retcrtase J* tbeir cflbifctv if 
each w> uiCviporatci-i m<!?\ fchiallv 

jiKihi 1 »fm Rin ion 03- i nt? dra^im^ am* M\»ri m j> k*&\' i 

I lguio I diu'totev iht ahi m ^" F u^h 5m tod tfbtuhh^ v^p*t*sw£ the 

U|>ui\^ ?\ &<tfc th« ahhtv 01 t cdi* to TW<vr^ f e otjl^s 

numKt of ->-Jitetfet\ip spote ^ shuvn io-r &t*ewrt manbers ot m-ponQKr In ttguw 
74, JLaia is* presented lot fib^^s puJsed v*th F N S-i: jwrtiA. owpaF<rd w 
25 *»Hobi^sts ^ * •» twutwl F"?5 repack a r*guw IS, dsn is printed 

?^br-^]^ txp:c«u*> e P^O^S as eo-npated to tilwoMa-ts txftessipg B^R ? neu. 

Figwc ^ reprtvaws d peptide wmpetstsor' Wirar a>^\ shoftjj^ that rte 
Pis - jrj pepKP, d«rave^ fcnt P5t>13, bird^ HI -\-A^ Ptpttde PIS^O *ah:bii5 HIA- 
^tr^ckxt frestytttjon i*t AnM^S prrtulfc t« CTL clone in TNF rekase 



WO &i£l*S3 



9 

bibass&y,. DISiMSS CTL is sp&dfk fm thus BLA-A2 binding kiftaeiiza matrix peptide 

Figure 4 illustrates the ability of T eel] lines generated from PIS?? 10 
immuni^d mice k> specitk&lly lyse FiS#IO-puhed Jusrkat A3 Kb targm ami P501S- 
5 iKuaAJuceti feriuH A2Kb target, as compared to BGFP-tjarjsdu^eti JurlcM A2Kb. The 
percent 3vsis s$ shtrtvn as a sarses. at efftrctut to target ratios, ai» mdk.Mt:d. 

h^ore 5 iliu-i^tt- it* t-t j T ceti uTo^e + o rtcc&.iu't m& 

specdkalty iyse Jurkat A2Kb cells expressing, the representative prostate^specific 
polypeptide PSOIS, thereby demonstrating that the PlS#tO peptide may be a naturally 
1 0 prrscess^d epiitupe of the P50 1 S polypeptide, 

F^ure ^fKl t>B *re graphs illuHrsjurg the ^ptaRiry oi » CT>$ 
Hue {3A-1) for a rept^ssejitatsve p<wtate>-speci& antigen (f50iS> Figure 6A slwvft the 
resuks of a ^Cr releaoe assaj. the perceai specific l>sis shown as it ?eries of 
cffrclortaxgei snaths* as iradioated. Figure 6B shows Shr prWuctiora of mrsrfean- 
15 g&mma by JA-E cells stimulated with autologous B-LCt trars&Jueed with PSOIS, at 
varying effeetos target rations as uuilcaissd. 

Figure ? is a WVstera blot showing the expression of P5$1S in 

bacyavsroSv 

Figure # illustrates the results of epitope sftappfog sn±dje« on VSOl S, 
20 Fijzuie 9 is a schemadc tspreseHtadt-a of the PSOI'S protein showing the 

IcjcaSkm of teansmenjbrane' dbsnaiss ami predicted mtacelhifar and extras llulsr 
ikatmns. 

Figure 3 0 is ^ getiotfde ^Hciwhig lite !«&ri*iri of the prox^te get>e:s 
P775P, P704V, Bj05D ( F7I1P awi P774P vwihip the Cat Eye Syndrotoe repots at 
2S chswne 22*|1 Li 

Figure 11 shown the results of an ELl&A assay to determine ihe 
specificity of rabbit rvlyclnrud ajitbera raised gainst P50IS 

IDNt > 1 t& the: J)C«cnwia®d cONA ^westye Aw FI-13 
SFQ \D *»\> ? is &c vUerarijaed ,V cDNA •w^eks tor VI- U 
30 SFQ H> Ntj % is thz d^enauitxi ^ " rDNA ^^s^« for VI -U 



to 

SHQ il) NO 4 i*v toe- u^MS-nwv ct .T i W< \ s^qneno? to-* PM<5 
M:<Q JD Ko ^ i<v d^ertru3*.d Ui'iA ^uwi^ Co-r HI-! 

Q ID f*0 6 is. fie Lkiemssa^i 755 <J>KA ^ur&u? tor HI-** 
^4 0 1H NO 7 is f*ie iktemu til ^ <J »N v. ^qycrtLe tor IH-4 
bFQ ID N'O $ n Uic sivts-si^irnnl 3* d'K \ w^&lO t^r J 147 
M 0 CD NO 0 n t^x Avi ju-i^d 5" iD\ \ v^uuiko for J 1-4 7 
^fO 10 N'^ 1 ^ i* 'He JHeatKiKU ? k F'NA sc^kt^c tor II ^1 2 
SfQ 10 N ffc 11 U v i 5 tiK JetemNB^ v tl'NA veqyenco &f 11 II 
SLQIOV' 4 l?iMh? k^mwevl 1 UAA s^uen^ v*Nl 1^*1 
^L\t ID N.O U is tit- Jtiermn^N? ^ \ ^u^r.es Co? N 
^K*II»NO 14 iMiL tktt*miiiod* lDN liquet* e for J 1-3 3 
sf o_ Ji> \0 1 ^ ts t\u >M<mm-v$ > ci>J\ \ j^us^c* i< t St -] > 
^1 0 ?D No 1 h is tlv J^twnowd V eO\ V tvqucaee & t JM 0 
^LO ID NO 1 7 t<« tHt V t !>XA fequ^kV tor Jl - 3 ^ 

\| 0 U^O H j« the J*+e tnin^l V tM\ \ Mqw,u * *r U ■£< 

!*F^ V D NO JIm,» dc?i^iuncd j* lDN I jcqmu\,+ K»r 
SIQ DNO "ixti J^nu^ds tDN Wquervelca V 
^3t 0 li>NO i.1 <sthf vLtemu b\i v tDN W^ucj-xe? for k! 18 
^bu ID NO 24 k 1 1 1 " d^an-ur^U 5 d\ \ if tK e t H K t t?i 
^FO »D\ t O ^ s the ^a^mmed 3' vf»N \ ^^u^e t^i oj 

^hO U> \n 27 \ iht detet^nned ^' tJ w k e tor U-4 
y^Q D \t> v .kitim ikJ ^ UA r I s^t Lrue Uw I I-U 
ii g 1 » \0 > i ^ lb? AUim iwJ -V clVS V a^ui e Ioj U 14 
5*1 H » \0 », 4 be ot tcm u si] V v DN \ t ^amct to II L 
Si Q 5 D sit* M ts t*ie dt\mii J V tPN A, «^qatJKe t« v l-"^ 
^FtJiP xO V is tk^ uetcraw 1 «-0N4 t,qut-TKe tor U-n 
<F0 ID 4 ^ »s ttj* uLterenwud * iT>N \ ^^l^w to* k 1 4^ 





SBQIDNO 


34 


.s the determined 


3* ©DMA seqi«H^ for K3~55 




SEQ ID NO 


*5 


s the detmrnaed 


3* cDNA sequence ibr LI -2 




SEQ ID NO 


ift 


s ill* determined 


3* cDNA. sequence lot LI *6 




SEQ ID NO 


57 


s the determine 


y cDNA ¥«i«eiM!e for NMS58 


s 


SEQ ID NO 


38 


is the determined 


3 f cDNA sequence fwNI-l«6fl 




SEQ ID NO 


39 


istlte determined 


3' cDNA sequence fftf N1-1S61 




SEQ ID NO 


40 


is the defennm&d 


3* cDNA sequence far N 1-1 864 




SEQ ID KO 


41 


is the determined 


cDNA sequence lor PS 




SEQ IB NO 


42 


is tire determined 


cDNA sequence for PS 


w 


SBQIDNO. 


43 


^ the determirJe'il 


cDNA sequence for P9 




SEQIDM) 


44 


s the determined 


sPNA sequence for PI 1 




SEQ ID KO 


45 


s the determined 


cDNA sequence for P20 




SEQ ID NO 


46 


s the determined 


cDNA sequence for P29 




SEQ ID NO 


47 


s the determined 


cDNA seqt*enc« for F30 


15 


SEQ ID NO 


48 


jt&. the dcfcmnsed 


cDNA sequence fur P34 




SEQ m NO 


49 


is the detennia&d 


cDNA sequence for P36 




SEQ ID NO 


50 


is the determined 


cDNA sequence for P?4 




SEQ JI> NO 


51 


"3 the determined 


cDNA sequence for PT9 




SEQ ID NO 


52 


is the determined 


cDNA sequence for P42 


20 


SEQ ID NO 


55 


„s the determined 


cDH A sequence for P47 




SEQ ID NO 


54 


rs the determined 


cPN A, seqtJCEJic>c for P49 




SEQ ID NO 


55 


is the determined 


cDN A sequence for PSO 




SEQ ID NO 


56 


u tht determined 


cONA sequence for 1*53 




SEQ ID HQ 


57 


the ileterreiirtsfd 


el>MA sequence far P55 


25 


SEQ I'D NO 


58 


is the determined 


ci>MA sequence far P6G 




SEQ ID NO 


59 


is the dnermined 


cDKA sequence for P64 




SEQ ID NO 




is. the determined 


cDNA sequence far P65 




SEQ ID NO 




is the determined 


cDNA sequence for P73 




SEOIDNO 


$2 


the determined 


cDKA seqociwe fat t»75 


30 


SEQ ID NO 




s the determined 


cDNA se<qucne« fat P7*i 
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SEQ ID NO, r.4 is iht* eksermtrrcd oDN A sequence fat P79 
SEQ JO NO: f*5 it the deJenmoed cONA sequence for P84 
SFQ ID NO ■ 66 is the rJetermi ned d>N A sequence for P6$ 
SfcV 20 NO. hi 53 flic tJc^rfmnyd oPNH s^usyce for P^O |.il«> itiVsd 

5 to as F7i4F) 

SE0 W NO: 6S is the J<rterniiiwd eDNA sequence for P82 
SEQ ID NO: is the determined cDNA sequence for lit -3064 
SEQ H> NO' 70 is the deterr&ined c-DKA sequence for U1-3Q&! 
SEQ ID NO: ? I is the detcnmtjed cDN A sequence to V ! -3692 

*® SEQ ID NO: 72 is Che <JctcnRiaed cDN A sequence for 1 A-3905 

SEQ TD NO: 73 j$ t^e d^rmmed; cDNA sequent for Vl<-36Se< 
SEQ JO -NO; ?4 is the defcartfisi;«J cD3s A seqatRte ft* R1 
£EQ il.> NO: ?> j.\ rhc determined cDN A sequence for 1 B-3 j >7o 
SEQ ID NO: 7^ ix tire; ik^raaineJ d>NA sequence for Vl-3t>7^ 

15 SKQ If> NO: 7; is the iJetermiiwi cDNA sequence for 1^4736 

^I'O ID Jm > $ht vkicmisix i cDNA s^jchu: f^r 10-4"^ 
SEQ ID MO; 7 s .) is the deiermtned cDNA science for 10474] 
SEQ JD NO: SO is ifes determined cDN'A sequence for 104744 
SEQ ID NO: SI is the determined cDNA sequence for IG-4734 

20 SEQ ID NO: $2 h the cteteansifted cDNA sequence ior 1 H-4774 

SEQ fONO: S3 is the ifetammod cDNA sequence for SUM 78 E 
SEQ II) NO: 34 is the deienmTc«J cDNA sequence for t!-M?S5 
id P ID \(> fc^ k tht Je* ttni*vJ vPN \ MqLeuce fo Hf-4 W f 
SfQvDNO ^'srthM^UnritBidtDNV^qucnion.rlll 

25 SI Q T ri nj<j st < ^ ^ l rt^Tirjirtf 1 cDN \ «*qikrMr \ r 13 4^^ 

^Lo Hitsu iv, *he A'tmt^tdA \ ^ t_tK.e t *r U 
S\ v If ' NO <«o ^ the J,.'*,- n .rwd oD\ * !*quc ice tor r-5«l I 
ST Q ID NO go i<c the ck*eim t c * vONTA ^cwra* tot U 4S""^ 
?J0'^N0 ^ iht k . temped v-DKA *sq«vTce IK ^Sh4 

30 Q 'i» NO ^ i tH«r i?«tennKTWt c^N \ vajuercc *or IK 4$% 



wo mmtm 
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SFQ ID NO ^ « the deternired <DK V wKjuerce for I G 4 ?t>l 

SI<Q r> Nt * «4 is tbe lieterwjrwU tDK \ ^acmce for 1 G-Pt>2 

SLO ID M > ^ u tht detenriml cDN <V oversee for 1 H-47**6 

SI 0 ID M > % u the dctttuMjvd cD\ \ .eqjew* ior 1 i 1-47 70 
5 \fct> U> M » i„ the dcttrrmed cttXA ^ence for 111 4771 
SFg ID Kt 3 » the dct*n~ir«d cDfcA i*\p.erc* tor 1 II 17 ?2 

NFg ID \( » *w h tihr dcttnru^d cOSA ^ «k* tor ID 42^ ; 

^Fg ID ** O UM w tbe deierrninrJ cP*i A ^v^uoikc ior 1 0-4 1 0"* 

c l g U i NO MIh ihe OfcieTrniruJ ct v^uuw tor IP \-4?7\ 

iG ^ fcO 10 NO \-Q the dtTOu-ed lHN A we PJ 43** 

al-U ID NO hh i± the dtfcTniwsi tDK \ ~ ^ jmicc t^r ID 42$ * 

SE 0 ID NO 104 *s Ihe d* te-puwd cDK \ -«tjc«kc tjr W 4j04 

SI Q ID N* * I OS s> tk- de&jrmwai * D\ * ^unce f<v if '-4?% 

SI O ID M ' 106 u tlx, deaawmed cO\ % >eqaeice k»r 10 4IW 

IS klQ W KO v ^ cetrm wd tuif iznath cDNA ^u^cc ic* H ] J 

f i-u ID \0 1*^ Jts vfce p*teJscLsi axs-it^i a^id t 1 

SH(J» ID NO 100 ? s the d^nwcJ Ml k&pth cD\A .-^ujacc Vt Jin 

SFQ ID NO- V>0i^ tbe J*,ienrp;n?ed %!1 length cl>h ^ ^querwe \*i 1 1 I? 

20 iffewd to m FMl S } 

Sl-Q ID V * 111 a ihe detwrnuitoi t^H ksjigtii vl'NA -e^yrw h x r \1 
1 162 (alsn Kfcred t.> » P^t^S> 

^Ft^ K * KO 1 1^ is tb s prcti^tcvl amtpo acid -ic^uif^co if 

>:} \J II> NO llhi UiC pre 3*kvl ,«ninu ^o±d se<f^-t>cc tor I 1-12 (jjvo 

25 referfed to as F50iS> 

Si 0 ID r«D * m r trie p^cJatc 1 aronic ^ic* scoutTc^ tot Nl-lfet*? uW> 

SFO JD M >' 11^ is ihff Jrtcjrunrd cD\ \ ictULQi.c fi»f 
SEO ID 1 1 fu , the dottn^j^f d clX\A ^ucoco for Y**) 
30 SFiO ID KD n } u ihe dctcrmiwd cDN '\ w^utocc for P<>? 



Ik. 



W0 



SEQ ID NO: H8 i& the dwerrnined cDNA sequence for f**5 
SEQ ID NO; 3 1 9 is die detcraiiritjd irDN A seqiiorce for P9& 
SEQ JO NO, 120 is she d^wniinea oDNA sequence for PliE 
SEQ ID NO: 121 is ihe lieifcrrGiacsi oDNA seti«i*r»e few Pi SO 
SHQ H> NO; 122 h\hx lieferrrjiacd el>HA ^ueac* fox Pi U 
SHQ ID MO; 12.- is the dtf$ensiizw?d cDNA ^tqu^ac* for PI 34 
SEQ ID NO; 124 is the Jeteraiiiwd cDNA sequence for PU 5 
SEQ H) NO; 125 is the detcnniiwfd cDNA sequence for PI 1<> 
SEQ ID NO: ^26 is the detenum*} cDNA sequence ftw F 1 24 
SEQ ID NO: 5 27 is the determined c&NA sequence for H26 
SFQ ID 1^0: 12S is the detrained cDNA sequence for PBO 
SBQ IT.) NO: 12*> b the dett&wiuwd cDNA sequence- for VH3 
SbQ ID NO; I SO is the determined tlW A sequence fur P13» 
SEQ I'D NO: 1 31 is the deferred cDNA rcqutsocs tot' PH3 
SEQ ID NtX 1 33 is tte desfertajjied cDNA *<?i|u^jc« tor H51 
SEQ ID NO: 13 1 is the deiermiwd sequence tijj PI 56 

SEQ ID NO: 134 « the deterrtutsed cDNA seqweace for P15? 
SEQ ID NO: 1 35 i* the determined cDNA wsquence for I s 166 
SEQ ID NO: i 36 is the de&mniaeti cDNA sequence for Fl^ 
SEQ ID NO: 137 is Lbs? determined tDNA sequence for R7& 
SFQ ID MO: 1 38 is Uwr deterrained eDN A ssqueuce for H79 
SHQ ID NO; 1 39 b the determined cDNA sequence for 1*185 
SEQ ID NO; 1 40 is the detenrufled cDNA sequence ft* PI 92 
SEQ ID NO: 141 is the deterousied cDNA sssqufcecs for P201 
SKQ ID NO: 1 42 i 1 .; the defcrmatved eDN A wjqiwsu&e for 1*204 



SEQ H) NO: Hjii:tt*.ideit:nv 
SEQ ID MO: 144 is the detenu- 
SEQ ID NO: 145 k t;v .i-.-tm;) 



rted cDN A ^qtven-ce for F2E)R 
oed 1.DNA sequence for P21 1 
«ed cDNA sequence for P213 



SEQ ID NO, 146 is the determined cDNA sequence for 1*219 
SEQ II? NO; 147 is the dteteriained cDNA scquease for P237 



mq IB NO; 148 is the detvU cDNA mpmm for P239 
SEQ IB NO. 149 is dw determined cDNA sequence for P248 
SEQ IB NO. i 50 ss for determiaed tSMA sequence for P25 1 
SEQ IB NO, 151 ts ihe determined cDNA sequence for P255 
5 SEQ IB NO; 1 52 b she deleramsed d>NA sequence for P256 

SEQ ID NO, 153 b Ow determined cDNA sequence for P259 
SEQ ID NO- 154 b Use d«tetmk£d cDNA sequence for P260 
SBQ ID NO; 155 ss ihe JetermmeU cDNA s-equerice for P263 
SEQ ID NO. 156 is (he determined cDNA sequence for P264 

1 0 SBQ ID NO: 15? is ?he detomked cDNA sequence for P266 

SFQ ID NO: 1 58 is the determined cDNA sequence for P27Q 
SEQ ID NO: 1 59 Is the deiemuwed cDNA sequence for P272 
iCQ 10 NO: 160 is lb* deterauned «.D* JA sequence for P27S 
SEQ ID NO: 161 the determined cDNA sequence for Pi 05 

1 5 SFQ FP NO. I «S2 ifw ' taegxnuusd cDNA sequence for P 1 0 7 

SFQ FO NO' ld3 i*the determmfcd cDNA sequence for PI 37 
SEQ ID KG; 164 is the determined cDNfA sequence lor PI94 
SEQ ID NO: 165 h the d«(cmiiued cDNA sequence for f\95 
SbQ !D NO: 166 & she detemutjed cDNA sequence for P J 96 

20 SEQ ! D NO: 1 67 is ibe determined cDNA sequence for P220 

SEQ ID NO: 168 k ihc determined cDMA sequence feu P2J4 
SEQ FD NO; li?9 as ibe dcteimbwxl cDNA sequence for P235 
SFQ 10 NO 1 70 j> ihe determined cDNA sequence for P243 
SFQ ID NO' 1 71 the dcterm^J cDNA sequence fof P703P-DK1 

2S SFQ m NO- 1 72 j?eda*ed vimmv acid seq uence for P7G3P4JE1 

SEQ JO NO: i 73 » &c dctemttned iDNA sequence for P703P-DE2 
S!*0 m NO: I ?4 k the determined cDNA sequence for P?tB P-DEt> 
SFQ ID NO: 1 75 » tfw Jctmntned cDNA sequence for P70?P-DE1 3 
SFQ i 0 NO; ! ft is tf* predicted sraaw add sequence for P7Q3P-DE 1 3 

m SFQ J D NO, 1 77 « she determined cDNA sequence- fur P703 P- DE \ 4 



wo mmsm 



w 

SKQ m NO: m is fee predicted amim* acid ss^ueiace far F?0SM>E!.4 
StQ ID NO: 179 is the Jewmuiwd extended. cDNA sequence for IG- 

^i\> IP N lO tjtO * the >iKfmi*k*d e\*s>kte'* <.m\ ^wwr to* Hi* 

SEQ IB NO: 181 is the determined extended. cDMA deqwaice lor 1G~ 

SEQ ID NO: 182 is the Ort«rmiiKd extended cDNA sequence for 1G- 

13* NO 183 is tSw iku-rraiivd cMcrsted d'NA sts-ju^ice (Vh IH- 

5HQ ID NO: IP is the Ustemuned «xl«rcied cWNA. sequence for 1H- 

SEQ \D NO: \tS h the detansmat-d ratended cD'NA sequence fee ill* 

s: O ID "NO m -s fre det-i«auifid ev;?nde*i cT>N\ ^uence ior 1H- 

471? 

SEQ ID MO: \$7 is rh>c <kfe?rttuiarei extended cDNA ^qiscisx EW 114- 

2f Ut-Q ID NO: 1 S8 is the detennsned attended cDNA sequence lot U- 

4807 

SFQ1DNO: 18$ is the *ae«ninisd 3* el>N A sequence ft* 11-48)0 
SiiG TD NO* VMS h the ifctemmwcJ T tDNA sequence for II -4« U 
£SEQ IP NO \*\ I- ^ dettrrnifced evtawkd ei>NA <Tt^ee fur U* 



4736 
5 4738 
4741 
4744 
4174 
4711 
IS 47SS 



25 me 



SBQ ID NO: 192 is ths deaenBirwd extended cDNA sequel* for IK- 
SHQ IP NO: 1<*3 is, ibe d^maincd extended cDtfA sequence fw IK- 



l*Cf/BS01i»lS?4 



I? 

SEQ I'D NO: 194 te fee idsamk^l extended cOR4 seqa©m» for lO» 

4761 

SEQ ID "NO: 195 is (he determined extended cDNA atsquesice for 1<3- 

4762 

5 SFQ ID NO. J96 i$ the determined rxten-kd cDNA serene* for tH- 

SEQ ID NO: m h the cteitnnimwl 3 5 cOKA sequent for IH-4770 
SEQ ID KG: 198 is the determined 3' cDMA sequ^ee for U1-477I 
SEQ ID NO. 399 is the determined extended cDNA seqaeace for 111- 

10 4772 

SEQ ID NO' 200 is the «letcrminai extended cDNA wqaencc for ID- 

4m 

SEQ IB NO: 201 is the detenairied extettded eDMA ie^ncace for IDJ- 

#271 

IS SKQ ID NO; 203 b the determined «temkxf ©ON A senses for ID- 

4211 

s^RQ ID NO: 203 is the dcr«nnia©d emended cDh'A se^eare fif ID- 

4283 

SEQ ID MO: 204 is the determined extended cDNA scQueai&e for \D~ 

28 4304 

ST\Q ID KG: 205 is the determined extended eDNA setpjejaoe for ID- 

■ 4296 

ID NO "Nte t*. d>e detertmne-ci ex*-emfrd cDNA seqistjRC^ fnr ID- 

42S0 

^PO 11) NO JoS ^ tkc d^omsmed >J fSA s^quersoe for iO H hkait 
^PQ ID Ns * >^ ^ the dcferanseJ >J>SA ^qwm.**.* tot I !-OKrt'\ 
O ID NO ? \ 0 ^ the cteseronr^xf k 1 >NA ^wmce for 7 {.■>,(> tw J 
SPQ !1> NO 2H is deiertturjeJ cDNA ^itwc for 7,^ev 
30 SFQ ID N( t the ifcarmuwd cONA< we fW M<5twtf 



WIS 



SEQ IP NO 20 is the detennioed cDHA sequence for S-b5nsv 
SEQ ID NO- 214 is the dettsratned cDNA sequence for 8-Wfwd 
SEQ ID MO: 21 5 is the determined eDNA seqaence for 8*b(* tev 
S£Q ID NO: 216 is the iSetennined cDNA sequence for 8-<i4fwd 
SfeQ ID NO: 2 1 7 ts the determined d.)N A scqaeacr for 8-d9rev 
SEQ ID MO; 21% h ihe determined sDNA sequence for H'M"wd 
SEQ ID NO: 2t*> N the determined cDNA sequence (<>r 8-|E*r*v 
SEQ ID MO: 220 U the determined cDNA $equsnc« Ear 8-q3 Irev 
SEQ ID MO: 221 isthedtete^ncdcDNA sequence ^« g-112f\vd 
SKQ ID NO. 222 ts *he determined cT)N A seqamoe for g-fi'rev 
ShQ ID NO: 22* ?a the deitamined cDNA w^ueoce for SWS 
SEQ 10 NO: 22* N «r« determined cDN A sequence for P5!«S 
SEQ ID NO: 225 U ihe determined cDNA sequence tltf i'* ? 0JDO5 
SEQ ID NO: 226 h the determined cDMA sequence fur 9- AH 
SEQ ID NO. 227 \k the determined cDN A sequence i*r 8-Cb 
M-'Q H> NO i i tisc arter^inU eDWv AiKncc fi* K-U/ 

ID NO- 229 is the <jeteram^4 <DNA sequence tor JITPN * 3 
stEQ NO; 230 is the determined d)NA sequence for 4FTPNI4 
SEQ ID NO- 23 i is the determined iDNA sequence for JFFPN23 
SEQ ID NO: 232 is the determined cDNA seqiiciiee lor JFTPN24 
SEQ ID NO. 233 in the detenraned cDNA sequence for JPTPN25 
SEQ H.> MO 2.14 & the detestuncd eDN A sequence for JHTN'Mi 
SEQ 10 NO: 535 as the determined cDNA sequence fisrJPFFN r 34 
SEQ ID NO- 23£ is the determined cPMA sequence for PTPN3.5 
SEQ ID NO: 23? \s the determined cDNA sequence for JPTPN36 
SFg ID NO, t& the determined cl»SA sequent* i<* 
$EQ ID NO, 23^ ?a the. deSermnted cDNA <^uemx tar iPFPN^V 
SBQ ID NO: 24t) is the determined cDNA sequence ft* Jj'TPM40 
SEQ ID 'NO: 241 is the determined cDNA sequence for JTTPN41 
SEQ ID NO- 242 h the detcmuned eDNA sequence fer JPTFN42 





SBQIDNO 


243 


is the A'lcnr.mod 


cDNA 




SBQIDNO 


244 


is lbs; determined 


cDNA 




SBQIDNO' 


245 


is the iktetmibsd 


cDNA 




SEQ ID NO; 


246 


is ihe dstemuusd. 


cDNA 


5 


SEQ ID KO, 


247 


is the determuasd 


cDNA 




SEQ ID KO, 


248 


is the- determined 


cDNA 




SEQ ID KO, 


249 


is the determined 


cDNA 




SEQ ID KO- 


250 


is the ctetermmed 


cDNA 




SEQ TOKO- 


251 


is the determined 


cDNA 


10 


SEQ TOKO- 


152 


Is the determined 


cDNA 




SEQ ID KO: 


253 


is the detmmnsJ 


cDNA 




SEQ ID NO: 


254 


js the detemtmed 


diMA 




SEQ ID NO: 


}5S 


it; [be determined 


cDNA 




SEQ ID KO: 


256 


is the determined 


cDNA 


IS 


SEQ ID KO: 


25? 


is the determu^cd 


cDMA 




SEQ TO KG: 


258 


is the dctermiaed 


cDNA 




SEQ ID KO: 


259 


is: the deterrrenol 


cDNA 




SEQ ID NO: 


260 


lis the detemimed 


cDNA 




SEQ ID NO: 


m 


is the determined 


eDNA 




SEQ ID NO: 


262 


is the determined 


cDNA 




SEQ ID MO. 


263 


H tfce determined 


cDNA 




SEQ ID NO. 


264 


^ the determined 


cDNA 




SEQ ID NO 


265 


tathe: detenfltrsed 


cDKA 




SEQ ID NO 


266 




cDNA 


as 


SEQ IP NO 


267 


is the ddermietsl 


cDNA 




SEQ JO NO; 


im 


Is the determis^ 


cDNA 




SEQ ID NO; 


im 


?^ the ddcredaed 


d>NA 




SEQ ID NO; 


270 


5S the detorcn-fsed 


cDNA 




SEQ ID NO; 


271 


5S the determined 


d>NA 


30 


SEQ ID KO: 


272 


sg the determined 


cl>NA 



forJPTPN46 



k» tor iff PtWi 

KscfbrJPiFi 
ice tor JPiF2 



e forJPlB2 
eforJPUM 
cfbrJP!A4 



rcmmwwi* 



m 

SI~0 ID N< ) 2 * * ^> the JetuwxM c^MA v*ji & ^ tor TU< P 
Sty ">No ^ ^ k m ( nod u> I \ -l\)ix>jm t i T 1 1 
St U lt> NO i Uk A t 4 "r«T vd v IP V m-\h eiHk « U '"M I 
q ID \0 '"is \ tK» vt» d U *\A c ^axv ^ T H*1l i i 
■< IP k <0 r"7 >\ t» *. * arnu' <*i l!>\A c v to* nil ,2 
MO Nt, ^ - d\i\ rrt usd <l J *N A scsjw fc o tot J 5 \ B P 
*.F01PNO 2" stKvM*tT»*dcIA r \ ^ etxe^JP U 
SFC, IP \0 2*i> n tbir AHet i imlcLN ^ * *\ erte *» *• W 2 
^lQ: i >\0 2^ \X Z J^itaneu c*Vs:A **-t|ii*. ^ to- V PSIP 
SiOn^O ^ N IS U * i \,F5TT*od i A jcm. L W ^ 5 P 

fet-y LI NO its < s ^\n5 mi d^J A n er^t V 

SI y ')tu ^ 3 v *ht knn * *uoo c s Y4 \ ^-jm to* ? 'VI 
My, NO fc R« ^ *h*? v.k Tn^d c' *J A ^ iue*K U * J * 
SHMPNt 2*" i&iK^-Ut *tfvUON\ ^uti*' KrX^-Ab 
Spy Xti N(t s tl e ^tte fti icJ riA A ^uur n. + Ni if M)6 
sfQ/DNO ?vn<aVdeetr ne<i U>Na -^qo^nce nrJF$F5 
bltCpDNO 2°H) is die ck««m m> cPNA %qi*iK- lor Tb\* 

SF* % TO JO >l *lt-> A*tfTTft3tlcd .PV^ (*|UUWCtv» R'SC. 

Si O tte N L >k J^ H ^etc rn^cd i J 'W se^ocr^ c JP? ^"^ 

^ > ;»> 1 U ;**4 1 K^c^i,iKr *. \ Svnjl t ^ttv tM i?st- 7 
ID NO N ^ its the 4f?e juu^d cl) H \- ^snvx u i 
^Hk) U \i ^ r tk ^.iiim^d v-I iN >\^vre x . til? 

{> II N' * r i *7 > tiPC ^biro c! K K l \ ^ utn< ^ JFRU J< 
M Q V Nu 2 ^ is tii" dcVu sited \ s„ ^\ ik ^ t»*r h <C10 
J4Q'D NO < is th^ dtterm Jud ^ UN A. ittc k* tot T^ip s 
MiQ ID NO ix ft ^ J « i-tte^tmned cl \ A ^act ce *or T & Sr x 
Shy I n NO 1 t* e ^tetm^evl g">N \ ^irrec tor ? P^b^ 
SfQll NO «(P vbiieU^n^c^NAv^iK-cet^ 1^ 
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WQ ID NO:: 303 Is &s feem^d ftDMA s^we for JF8C12 
SEt) ID KG: Vi4 is the determined cDNA sequence for IPSE! I 
SEQ ID 'NO: 305 Is the determined cDNA sequence for 
SEQ ID NO: 50*5 is ths amiso acid sequence for the peptide PS2M2 
5 SEQ ID NO: 30? \$ Has detsrmkm>d cDN A sequence for P H 1 P 

SEQ ID NO: vs { s the tfe&rraviwd cDN A sequence for P?12P 
SEQ ID NO: Jft<) 3he determined eDN A sequence tor CIONE23 
TEQ 10 NO. 310 is Uw determined cDNA sequence far P774P 
SEQ ID NO. 3 U is the determined cDMA sequence for P??5P 
10 SEQ ID NO ■ ? 1 2 is the determined cDN A sequence for P?l SP 

SEQ ID MO ' ? B is the determined cDNA sequence for P"? ! OP 
SEQ 10 NO: 3 1 4 is the dciercninsd cDNA sequence for P767P 
SEQ ID NO: 3 15 is the determined cDNA sequence for P768P 
SEQ ID NO; J 3 6-325 ate the determined cDNA sequences of previously 

15 !§oMe4 genes 

S£Q ID NO: 326 is the determined cDNA sequence f<n-PT03FPE5 
SEQ ID NO; 327 is the predicted sumac acid sequence for P?COPDlr.5 
SEQ ID NO- 328 ss the determined cDNA sequence for P703P6.26 
SEQ ID HO: 329 s$ the predicted amino acid sequence for P703P6.26 
W SEQ ID NO : 3 3 Q is the deierouited cDN A sequence for P703PX-23 

SEQ ID NO: 33-1 is the predicted amino acid sequence for P703PX-23 
SEQ ID NO; 332 is the determined foil length cDNA sequence for 

SEQ U> NO: 33,1 is the determined extended cDNA sequence for P707P 
25 (also referred to as 1 l»C§j 

SEQ ID "NO: B4 h the determined cDNA sequence for P7MP 

SEQ ID MO; 335 is the determined cDNA sequence for 1*7051* {aho 
rdcfred to as 9-Tlj 

SEQ ID NXK 336 Is the pie-theled wvuuo twid sequence V70SP 
30 SEQ \l> NO, 337 ss the *tmm<> aud m^wcrcc of the peptide PiS*lt> 



wo %y$M 



SfO JD * v, tK ptcdutes* awus k -i \£> ut " * ot 

$lg SO m jsthir i: toini*vd o*\A l^* Vt 1*78^1 
§ SfO D "SO js d detent iud tPN \ -c-HTce fto I 

<i 0 ID \*0 * * * i the ^ekrrat* ^ d^N \ > eqi^ace to a« showing 
hnT<slo£> to 4o,Tv> *apt£n> MM4b tsKKA. 

10 M'Q in M> ^ v ft? uuefrrW vl? J \ st^iriKc fc« ^\>txo -Owinfc 
H>jh)1» ^> to 'So * ^krs niiv^fA toi L eadlo? $ 

si O 11 V > W> ^ tt ^ detv -mix I d>t\A r^qu«t ^ iot ,t k ^wiiig 
h m \v tv> tluraao nuUeat e»?«k\' ttototlordn I trtnt lv;tru ) Ttctta twa-ferase 

11 Si\> 11^ NO l r > t*K? dctnr «H d » * >N \ Ci « a clowr * wwiug 
honvV^ to tfo*m ^pfev. ntfui*' .^i,t.we ^x^vd wvwp!-««e |«atein2 
(NKA&IP2) 

SFQ 0> NU "U \ tin? Jcte* FonoJ l^NA my^*^ fur a ciiw -howln^ 
foir^ltt^ H Uorh sapv^fWf!s>^UK^mm^ ivhM p^ir* il\>^^ * 
2§ S!~Q ID v * * 3^ i-s tH A l tut i ^ J ► H\A vjqirf. ic* k * j, <. in i* huwng 

IT NO 1 ♦'w Azitoi ^ td cl>N \ veqa^ >ct t< r ? P 
^rO ID HO ^3 1 w ? ttr*s ^ rwd v DN A, ^uerc^ for PT79P 
1 0 iD NO - " " ^ tl e iieterraro.1 cD>s A - oan* ace ^ > F 
25 St J J | n \ T t ) 1 * " is tSio Mk k l»I>N A v.\ e io !? 7^4? 

^1X» in "^^ * i*i On? dct su> K<i U \ cqi t ?M J 
S| 0 TL NO * \i frf Nil vPNa ^tiu to? P'fl^rt 

1L> NO ^ ^ the aettjw ined v.^qoeisie tot P^44i 

SI Q 10 T is *t Mem ^dOKA ^-qucik * fu P^SS 

30 SkQ IP N* *'*\* d» dvttoBiTOd d^A ^q«CTue tot F"if?P 
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SHQ II> NO: 359 the Urtwnincd cDNA sequence for P7S3P 
SEQ It) KO : 3 60 is the tfetetrniaed cDN A sequence for unknown 1 7984 
SEC? IB NO: 361 i* the detefmiaed cDNA sequence ibr P787P 
SEQ ID NO: 362 is the rfeteraimed cDNA sequence tot P7*8P 
S SEQ ID KO: 36* is the determined cDNA sequence for sinknwn 1 7994 

SEQ TD KO: 364 is the drtemaned cDNA sequence ibr P?« 1 F 
SEQ 10 KO: 36S the determined cDNA sequence for P785P 
SEQ ID NO: 366-375 titm sfeiermirted cDNA wHiuenoes for splice 

!§ ShQ ID NO: 3 ?6 i& the predicted ammo acid sequence eiscoded by the 

sequence of SEQ ID NO; 366. 

SEQ ID NO; 377 is She predicted amiau acid sequence crtaxkd by the 
sequence of SbQ ID NO: 372. 

SEQ IB MO. 378 is the padicted antiro aud sequence jxxxled b\ the 
IS sequence of SEQ ID NO: 37 j. 

SHQ ID NO: 379 is. the predicted atftiBsO sod sequence encoded by the 
sequence of SEQ ID NO' 374 

SEQ ID NO: 3 SO- is the predicted amino schl sequence encoded by the 
sequence of SHQ ID NO; 375. 
20 SHQ ID NO: 38 1 is the determined cDN A sequence for B7J6P, 

SEQ JD NO, 3H2 is the ietenaiaed fuU4eug$h cDNA sequence for 

F71IE 

SEQ II> NO; 383 is the pr^ktei amitm add sacfuenes for P7J IP, 

SHQ ID NO: is the cDNA acqocnw PIOOQC 
25 SHQ tD NO; 385 is the cl>NA sequent?*- for CGI-82 

SHQ IDNO:386 is the cDKA sequence for 23320. 

SHQ CD NO:3S7 is the cDNA sequence for €Qh69. 

SEQ ID NO;388 is the cDNA sequence for L4dU»W^ydrogcnase 

SEQ ID NO: IW ,<t the <Bh A wqurnre for J » 
30 SEQ ID NO: m i* the cDN A sequent lor ^33 & I 
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8FQ U> NO ^ rtJ? ot)N A ^qiienoe ts>f 2 -o^"* 

SI 0 NO 5 ,J I ts -fa d>NA vequera* li 'f HCf HP 
SFO LO NO "*>5 the dDVt sequence tor twjvsLiJtunuwsc 
Si-Q V* KO;«t> is toe rf^UinaJc toj apeevu^h wkMbfod 
SI g 1 1 > NO* ^ ? ^ UK ol )N A ^waaf lor P.M' 

Stg H) NO/VR i d-t vlW .*h|»CTM» ioi It* u^riru^n fcct.* 

SFQ li>\0 ' >9 iks d>NA sequrovt tt* h IGR 
SI g rj> NO- js the vf*MA m\|C£?^ <£ £«■ KIA'V>J^\ 
suj ID NH » *>i OV\ V *»4utncif t^r 2?>t5 
NhQ 11 > \*t i she tDh \ ^eqtem^ tnf 225$ * f 
SEO U " NO ^T3 s " the- v DN A ^qne x v fc tot K'^ 
^fcQ H> NV 404 is *he cWSA sequence for ?^Sii 

NrQ ID N< * *Mf> * the *V*VA * tor J ^ 

M-O ID MHO? ife ihe viA'A. ^utrjee ^so teo^ as 

:>Fg to NO 40S k °v sequence t^f 
SFQ Li *N< ► -Ut^ yi H>e i HNA so^n-to toi "it>2 
ShQ (S< NO. *3l! r tht h E*N ^ st quested iUr "C5^ 
ciFO iDKO 4 J1 rs the oi>V f V se»tusn« Jbr 2-3^ 
SH » ir> NO IP 1 1 N*A ^'Krwe Jiff ^fr* 
SFO lit VO 41 * is ihecPKA ^uemv i>r *'^5?0 
|D h< U N* ^ tV ct ?S % >cq^ui<- tor .2 i?L 
SFQ ID NO 4 ^ ^ t r A seqwene* tor 
^FQ 1^ NO 4 -3 tin d>N A se^uc^e tot 22^ 
g m NU %\ 1 the J M V fi v 22 " ^ 

SfcQ U 3 VJ is the v ON \ 3^it*nce for .^S75 
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SEQ B>NO:449 itie cDNA requence fr*r 
SEQ ID NO:4SO Es the cDNA sequence for 236 j 2. 
SF\QlT»iO;4S| 1* ihecDNA apitc iW2WM. 
SEQ ID NO 4 W i& tiic cDNA sequence 1 or 236 18. 
$ SCQ ID NO:453 ts the eDN A sequence tor 23621 

SEQ SDNCM54 is the cDKA sequwsce for fbMc hydn^e. 
S£Q ID NO; 455 is the tDNAsasquerae frrUM pronrie, 
Slty (D NO:45& the cDN A sequence for & fentwn &a*e, 
SEQ ID NO:457 is the cDNA ysqaent* fcr a knovm gen* 
10 $eq td NO;45» « the cDN A sequence for a previously ideoafied gene. 

Sr.Q ll> is the ePNA sequence for 23045. 

ShQ ID HO;4hO is the iD^A vcqueaoe for 
SEQ IP NO:46! i* the cDNA sequence for done 23054, 
SE.Q ID MO'462-467 are cDNA secjiKTOce* for k^owa genes- 
is SKQ tD NO:46S~47! are cOMA sequences for P7I0I*. 

SEQ NO.-4?2 h a trDNA sequti&efc tor PI001C 
SEQ ID KQ; 473 is the HetertMiBed uD&A sequence for * *pi«* 
variant of'P7?5P {reTencd To as 275GS), 

SJrQ |p NO: 474 the dci«mine*i cDMA sequence for a second spike 
20 variant iff P775P 5 referred to as 1 <W), 

SEQ ID NO; 47S ii the d^rmsMxl cf>NA scqurace for a thiri spike 
variaut <rf P77SP (referred Jo as J). 

SEQ ID NO: 476 is the determined cDNA sequence fnr a fourth splice 
variant nf P77SP f r*f«rr«d w as 1 9»? 7). 
2s SRQ ID NO 47? is a fin* ptftdktted amino acid sequence caewted by the 

«qtse»ce of SEQ ID NO: 474. 

Sfc.0 ID NO 4?£ ii v -^vud predated trewv acie <«v»quenoe ei^cnied b> 
the sequence oX SEQ ID NO: 4^4. 

SHQ ID NO: 479 is tla predict amiau ^id scq&encc snccnM by the 
30 sequence of SEQ tD NO 475. 
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^10 ID M s 4R« & a f r-t prcdjci^d amino j*ad ^cqucnc« srs^ccKfed bv the 
st ^ucnot ut SFQ "J NO 47"? 

SFQ IP NO fi] n d ml ^txt p^KvoJ 3Tin>,> acid c^uerwe cneovfrd b> 
the sequence <>i SI Q ID NO 4^ i 
S SiiQ ID NO 4$2 i\ a lb *d f t^duiv.d arcnro Kid s^qu^^ e eroded: bs 

^LQ TD NO 4-3 * is «i fourth ,-rcdicttd sirn^i < aad ^q^i lC uKoded by 
t£v jqutt*.* or NFQ ID hO 

SI\» m NO -J $4 in ihi frsJ H> arouse ^nk of the M ^pew^^ 

$K* ID NO as L-e PC* p met 4\fc W 
SFQ m NO 4** is f-c H H prime A\>> iV3 
SFO ID NO 4*? ^ t^e PTH -rrawr AW tV* 
SI O ID \\> 4*s ^ Fi R rnwjer WCCt 
15 "■sfco; ID SO -41 pepl its, cr^k>*.d ^pitcy*; j rapping MSidje^ 

STQ IB NO x\2 4 K* A'temmft! ebN\ wqiwiKC ot 
co^pimieiHoui^N det^mMtentJ! teyon Km- t'leauh PM}^ rcVct^f intjNf*^ ^0D4 

ll> NO 503 tis ^eteraneJ cr*NA ^qocKf of hse 
cempJcmadMof* dptartuiLtig region tor t *mti P^DN rr "nudoitat :mtjlXNiv i \I 
20 SIQ IF> NO ^"U MJ^ -iR" p«?pfiJo^ ^ployed epitope mapping 

ko^pYmerfcatT.^ {Mcmum^ r^ion toi the anU P7fFtF monooiul *nl>W4y $1^ 

fcr\* FD NO i«7 is oekttwtu it»NA s&ji.«icc the 
25 t ciTipk'tt't'ukv'Ttj ^damjjiui^ j^^h t tor the ^nt'-P /t5 monoc urit** *Kf) ,k J 

SKO Ui NO: 508 i\ ^ic tkterrafcsscl cDNA ,vs;<|w«;rcc of the 

SBQ ID NO; 509-522 £wt- psrpU<io trropksy^ %n apksp€ m&ppmg stud j<ss, 
SEQ JD NO, 5^"^ is ss msiutt; bmn of P?w3P used io miue {Wls^-kJics 

30 i^bsIP?03R 



Sl 0 N» * i?4 i tLt <ft *ttU jt igth >N V *»x w ol I MI* 

ST^ ^ Ki » .i th.* wvVJ «. tuoo -ikmi «i bv ^ O 

ID NO: S24>. 

<4 M I) XO ^ the IxiT kr;?*h tl>\ V uct ■ o 101 PJSOF 

MTQ H M* sM* *> tH ci* *^ NL^ut^ct % ? a u ^arussK *>f t*l*0 IT 
SI V ID \t » ^1 fre ^PN\ «vt »t tf* "^'i t\^d tv *r#ie v>* 
SL Q ID N< > > ^ 1 > U ib tvU ^h«h» ak« crtv V* J K ^ ^=4'^ <• °* 

seo'ioNo^Bi, 

nI f> iD m > ^^11 she -M^diuul m no jsjd ^^»t > ixvik i bv 0 

|5 ID MO: 533 , 

Si < * U> M 1 a trc t lUclt u$*H • 1 ^ ^ s^hctm K p t|rs 
Hi g l« » M> S ^ ^ ,t <o\> ^ H U-nyfi \.l \ •< ^ ^OS 

ID NO- S35< 

2(v s| O IV* NO ^ ^ ^ r*N.tlv led un rK> aw k*<u.s. aitod^i H H Q 

11) HO: 536. 

ST 0 r> \U M N ii Jv pepii it S- *7t 
\5 SFO ID W W r« < n e\teuic V kD\'4 ^ ieue to. ? I 

p ^^t*^ epcu 1 1- uu« 1 Mt^ 1 tthoi fro T J> 

SfQ Dt> NO ^"tl is i?etcrsnuv«3 c*>\A ^1 Kate a H*ue 
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SEQ ID NO: 571 is the detenniised cDHA sequence for a splice variant 
af P776P referred to as cemtig ?. 

SEQ ID NO: 572 is. die determined cDNA s«<|uesice for a rpitcse variant 
at P776P referred to as coitfig 14. 
5 SEQ ID NO: 573 is the ammo mid sequence ensued by a first predicted 

ORf of SEQ ID NO: 570. 

SEQ IP NO - 5?4 is the smvzo <*ckJ sequence encoded by a second 
predicted ORF of SEQ IP NO: 570. 

SEQ ID NO: 575 is the amino itcid sequence encoded by a prtsdic&sd 
10 OilF of SEQ 10 NO: 571. 

SFQ li* NO . 76 5^-6 as*? amuc sud sequences etK^ckd hv pted^ted 
QRFs of SEQ ID NO: 569. 

SEQ ID NO: SS7 is a DNA cd&8ert$.tts $tsqu«ne« i>f the sequences ol 
P767P asd P777P. 

IS SEQ ID NO: 588-590 are amino acki sequences, decoded by predtcred 

ORFs oT SEQ ID HO: 587. 

SEQ ID NO: 591 is an extended cDNA sequence far P1020C. 

SEQ ID KO: 592 is !fc<c predicted amino acid saqyeacfc encoded b* the 
sttjuertce of SEQ ID NO: PI 020C . 
20 SEQ ID NO: 593 is a tplke variant of P775F refcrod to as m<\$ . 

SEQ ID NO. 594 ts a sphc* variant of P775P referred to as 507 17. 

SEQ ID NO: 593 is a splice variant of P775P relented to as 45985. 

SEQ ID NO; 5% is a splice variant of P77SP referred to as 

SEQ ID NO: 59? » a splice variant of P775P referred to a* .17922. 
25 SEQ ID NO: m hf a splice variant of P5 1 OS referred to as 49274, 

SEQ ID NO: 599 is a splice variant P5 1 OS referred ttt ax 39487, 

SEQ ID NO: 600 is a spike variant oi PSCMS referred roas5l67.!6. 

SEQ ID NO: 601 ts a splice variant of P504S* referred to as 5167.1 . 

SEQ ID NO: 602 is a splice variant ol P504S K&rwd to as 5 163 .46. 
30 SEQ ID NO: 603 is a spiice variant of P504S referred to as 5 1 63 .42 . 
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SfeQ ID NO; 604 is a splice variant of P504S referred 10 as 5 1*3.34, 
SEQ ED HO: 605 is a splice varwnt <. f PWS Klbnwl to as 5163 1 7. 
SEQ TD KO: 606 is a spike variant of P501S referred to as l«G40. 
S|>0 It > MO: bO?-^ 5 are the scqucsite^ uf PCR primers. 
$ si\0 I [> NO: 6 Kl iv. the dcU'rm'iiwd cDK A sqwme at a fusbb af P703 P 

and PSA. 

SFQ 1ft NO: 617 is the wiaso acid sequent? d (te of P7G>P and 

MA. 

SEQ ID MO: 41.1 k & cIMA Mqacnc»<>£<t* gene DDSk 
10 0(1 KO h^h¥i^\tki^ a)^ tqu^k kn <y HP 

P704P. 

NirQ X* NO ^ »> ok c \ \ w,ur\* o 1 * U t m tat t ot P^S 
IS <T O 1 M ^ KvPMtqupwot nKo' it £F^^ 

ITle «5 «,PS^ ^ 

^fY> Vi NO ^ ^ «; tit d*SA. s<N^ jjm? *t <* sp'uv vrut i ot <■ 1 
RtMa-ed to m P^Xski w. 

SK> IP* V* ^f> i. 6* fi'itrM v> a >>K^ uittit uf P^s 

S^ 0 *&> Nk » ' « vIk t. & no ^ J i>tqi swis uxk 1 o\ HI 0 H IS 

25 ^ 11 NO i i <?u Wt ^ m. ad «scq ' ru,e c-ix^ev. ^ , 1 J. 

MO: 623 , 

'1\ IP \n f>l{ * t n tfl er« h vl M-nccne^ k r ^to<-tit - 
>etu u fcan t inu't? rcp- <a\e tc^c^-d ^ here u F 



wmmim 



31 

SBQ IB BIO ; €32 Is tfes mtm® add mqmenm mm&&& h$ the of 
SEQ ID NO; 630. 

SEQ CD KG: 633 b the swxiko acid sequence ■encoded by {he sequence of 

S SEQ m NO: 654 h the fall-length c USA sequence for ?7§gP. 

SEQ ID HO: 635 ts die %mxm acid Neqofimae eroded by SEQ ID HO; 

SEQ ID HO: 636 is the determined cDMA segsseric^ tor a polymorphic 
variant of P788P. 

!§ SEQ ID NO: 637 is Lbc amino acicl sequel eocoski by SEQ ID KO: 

636. 

S£Q ID NO: 63S is dis auikio acid ssjque&ee of peptide 4 fjtitn F703P, 
SEQ ID NO' 639 is the cDNA sequence thst mcodes peptide 4 ftom 

F703P, 

1 S SEQ ID NO: 64CM>55 are cDN A scqutKces. sneoiimg epitopes of P7&3P. 

SEQ ID NO: 656-671 are the amino acid seq-asuces «>i epiiopes of 

P703F.. 

SEQ 3D NO; 672 and 673 are PC R primer*. 

S£Q ID NO: 67^ is thtf cJDKA sequence encoding a» N-fcrmtaal portion 
2$ of P78$P -expressed m K mii, 

SIU~> ID NO; 6?$ is sh« smiins saud ^qyemx of th«H- terminal portion of 
P7&SF expressed in £. £»jV. 

SEQ ID NO: 6?6 is the amiftO fl^d sequence ot" ih«r M tuhwrcttiosi* 

imtigmi Rj&!2.. 

2 5 SEQ [D NO : 67 ? rod 678 are PGR primers 

SEQ m NO: 6?* ^ Sh« *rDN*A MMjuerox foi the Kal^ltHM: 

SEQ ID NO; 680 is $hs cDNA scqiKoce for the P5 SOS-C wmsfrswi. 
SEQ ID NO. 68 J is the cDNA sequence for the P5 construct. 
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SEQ m NO: m k the **un.o 4C«d sequent for the Ral2*P5lGfrC 

Kl'Q il> NU: e>S * is iht* unmio icM ti-r she f\S ] OM cuus&n*ct, 

SiiQ in Nij, b§4 is Hk* «wram> ncid .^i^nce €-r ihc P,^ 3 CM\3 svosttucr. 
5 SFQ ID Nt >: o85 * oi« PCR pnBWTS. 

S&Q ID NO; t>vl si, the d>NA p^yynuc tlie eomtruct R&);'.-P7"">'5P- 

ORF3, 

Sl'Q ID Nt>, I- Jh« jlUHf-o- &eU veqoertee the ^wtnici R^IJ 

P77SMMP3, 

10 SEQ ID N'O: 6§3 and 6v4 «k POP. priroeara. 

SEQ ID ?X<: <?95 i>s the <lctermmc<| rjrarao acid ^x|Uem^ Fur s P703P 
lag. fomon protein 

SFQ ID NO. €96 Is ihe darerioiRe*! tDNA ffc^cfice tor a P7Q3P His tag 

IS SEQ I D NO: <W ? and are PCR pnnwrs, 

SEQ ID NO. &w is tbe detertn*ntd amia»> add sequeote lbs- a I V 705F His 
tag faston protean. 

SEQ ID NO. 700 ^ the temu^i eDNA «cj*wvre for 3 WC5P His tag 

idiots pmteits. 

20 SEQ ID NO. 701 and 702 Me PC R primers. 

SEQ ID NO, 703 k the de^reimed amino acsd sequmce for a F?l IP Hi? 
tag 1kwkm protein 

SEQ ID SO. 704 the detetmined cl>NA sequence for &F7HP Rb ta% 

fumm pmtvm, 

IS SEQ ID NO, 705 ls the jsmino jctd sequence of tfee if ftk-miteij 

SEQ ID NO: 7i>6 tin J 7^7 an; PCR prinw*^ 

ShQ ID NO, 70S the determined d>NA Ktqutiiice Jot the artefact 
RaO«PS01&*EL 
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SEQ ID "NO: 709 ts the determined ammo acid sequence for the 
c&nsiroct fU J 2-P5 0 1 S-E2. 

SEQ ID NO: 71 0 is the amino acid sequence for $n epitope of P50I S. 
SEQ 3D NO. 71 1 is the DNA sequence encoding -SEQ ID NO: 710. 
5 SEQ ID NO: 712 is the amino m4 sequence for an epitope o( P50! S. 

SEQ ID NO: 713 is the DNA sequence encoding ID NO: 73 2. 
SEQ ID NO- 714 is a peptide employed in epitope mapping saidies. 
SEQ CD NO' 715 is f hs amino acid sequence for as epitope of PSO IS 
SEQ ID NO: 716 is On: DNA sequence encoding SEQ IS NO: 715, 
10 SEQ ID NO: 717-719 are the amino sad sequences for CIM epitopcc of 

psois. 

SEQ 3D NO - 7*^-722 sre the D!^ A sequences encvdhig the sequfcsees of 
SEQ ID NO: 717-7R 

SEQ ID NO: 725-734 are the amacf acid sequent*? for putative CTL 
I 5 epitopes- of P ?dJ.P, 

SEQ ID NO: 7J5 is she fiili-teng* cDNA sequcRce for 

SEQ ID NO. 736 is ihe asnirsn acid se^eRcs encoded by SEQ ID NO: 

m. 

SEQ ID NO - 73? ss the detcrmmed full -length cDNA sequence for the 
20 spUcs variant of P776P referred to as eontig 6, 

SEQ ID NO: are deters ineJ full-iengih ©DNA sequ*a«s for 

the splice variant of P776P referred to as con tig 7, 

SEQ ID NO; 740-744 are amino actd sequences encoded by I!> NO; 

737. 

25 SEQ ID NO; 745-750 are <s.raifx) add sequences enorHfcd by she splice 

v^srii ?jf P776P rsfcrmi to as conti^ 7. 

SEQ ID NO' 751 is the full length cDNA ssquwe for harjoan 
transmesnbfMie protease serins 2, 

SEQ ID MO: 752 is the amino acid sequence en-coded by SEQ ID NO: 

30 75.1. 
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Sl-Q ID NO. <53 is the cONa ^eqwenee efiuculs^g the tlr?^ 209 anuso- 
acids of humatk tTATi^meiTibrane ^rc. feax ^dne 2 

SFQ 113 NO ?S| is shs frst 2^ aruruj acids cl' twiRia** ^vsmembnme 

3 SB,? MJ 75* ts the oumtve acid 'sequence of pxptife 29ii 512 *tf 

PS0I.S. 

SFQ m NO' ^6* 7>u arc PCR pnHiers 

SFQ iV» 5 i* ih>- dnermried eDNA veq^-acc the ^ b chair* of ^ 
I s ^.tll receptee to± tire J' 50 tS-i-p^Cifk T *dl <J in*.- db*i 
lu SRC in N f O " f >f itw ^iefL-rmiiicd d">V\ ^leuof ^ *Jk Va dwift <d a 

1 cell icver^T fc<r F501^ <^vufk \ cell c.^re 4f5 

U> ^ the omuri^ ^quemv cnuodod ^ ID NO 

i>EQ ED NO. 76} the amino ^dd .ve^ei^k'c snowfci S1?Q IP NO 

15 76!. 

SHO ID NO T o4 io #k rull-lvt^Li open reading Irtish Co? PVfc&P 
awlwimg stop codon, 

SFQ ID NO, 765 is the tWi*kii£Jhi»pen rcatii&gt Cn*w for without 

20 SEQ ID NO, >& if the ammo acid ^quence trcoded K > SFXJ ID KO 

765. 

SBQ ID KO- 7&?-7?2 are the ammo acid .veaumce^ for predicted 
domains of 1*768?, 

SB> IT* NO t!>e tall length cDNA ajqucivc of P*«P 
25 SI i> ID NO T74 Oie cIXN<\ sequt-Ew <d lbs previously identified 

clans FIJI 5581, 

SB-0 ID \0 <■ ?S is she cD\A sequence of fit ojvrn reading frame m 
FS351* with blop 

SKQ ID NO 776 d^e cI>Ma ^eqt^fit:£' of op^vt rvydmg ftasue fcf 
30 1>S05P withem* stop axkwi. 
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SEQ IB MO;: 777 k tie isi-tagfe mhm stM m®®M® P835P. 

SEQ ID HO: 77S-?^S are ihe i*mm<> acid sequences of cxtmo;1UjS^r snd 
iRtracdl^ar domains of PS3:4\ 

SEQ ID NO: 7S6 is tie fiilMength cDNA sequence for PI000C. 
5 SEQ ID NO: 7§? is the cDNA ssquetice of the opes reading frame for 

PIOOQC mcEuding stop «mfon, 

SEQ ID NO: 78 S i: she cDNA sequence of the open reading frame for 
P1000C, wilhoul Mop csxlun, 

SHQ ID NO. 7&9 is the fulMe»g»h wift) acid sequence for P1OG0C. 
10 SEQ ID NO. 790 is ammo acids I- iOft of SSQ ID NO: 7$& 

SEQ ID NO: 791 is amino acids 10O-*92 of 2s HQ ID MO: 7S9, 

S-FQ 1J> NO, 792 ss she ^mifso acid sequence of an a prepro»P501S 
recombinant pmieiu. 



is oinmeo desouftion of the ihwiciiom 

*!he prosit iincm^n ^ ^ rev&eil ge leraih in &Mtpo-> m raJ lhar u\« 
mthcthsrapj **M dmgnmis of cancer partiuJat!} pev^w^i. canon ^ je^ct^bed rurthes 
bckm, UttiStra'' wmposits«ms »i the pjfNetit iFvcati^i i^Hidc hut jre not cast^e^d 
to, poivpiepfcicLs ^ifsuitfady ^mxnngcnw ptthpeptuko, foSvnuoVo*t«J!rsetJvi"H3ing m x h 
polypeptides aitibodic 1 arsl oiber t^ivb.jg agents vint^r p-t^ntir^ k ~ih 1 \FC\i <md 
iiaiKum? wstcrn ooHs if £ 1 cell"} 
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Mybiidralwn <ii Hamei S. llk.gmA, ed*: T I^HSk Ton^rtptii jfu\ Tr.itss'.itws (H 
Hatrctb & S i C£^^Liob« fc\k, 1^84). Amni^l Cell Culture iK- ^reshitev, ed , 5 ->iteV; IVrKsl 
A INr.*ctik^| OyUc to Modular Cloma^t I Wi. 

.All pultl!t r otHa\ potff^sts aid paivut afvpjcjtioiai oitcd Km hi, ^aher 
5 iw Intra, hwby mvospwaleJ b> tekt*:ao? in their ennrer], 

As* u&id tlijs sp$ci£ic;i1u>n aivi Ok* *aj>pesvkd drums, the sinful at forms 
''a./' ''asf .*ss,d "ik? K b.kkk* phvtj references ualess the orc&nt dou-tv dictates 
oidMjrwise. 

10 -\s y$ed hetvin, tb£ krra '^^hf^piid^' " ss used \n iU ccaventlottal 

sucjfjutg, is.-. .as .^j^Givc of amii>i> ac^-i The p^Upeptide-i mi. Jiimtcd to J: 
ispecatlc kngtb at the pmdueL th*Lv, |Hjpfkk,\ jjHgopefHLIes, <wd po^ins arc included 
wsOyjs ihc dHirotkut of pohpephd^ aad sttcSi icfshs ra^s beua*d interchangeably hetem 
aak.vs sficcUk^nj mdicated demise litis ierrn sis* Joes not refer to oj «\duJe post- 

IS wtprwuion m^ditkatiot^ the vpeptid.fi, tat sample ( gtvwj.y lathers, ac^lsUons. 
pbc*pfK'rylaJJOi^ -wxi the Kkv, as well as nthet rruxhtkarions known m iHe art, b*»th. 
ua^r^llv in-cunsng and rn*n-naHir^h occurring, A ^vtypeplide may Is: an eMire 
pvtit^m, a, jub^jtaetve tireirof. Psitkukr polypeptide^ ot tnt*wist tn the context «i 
this tm cotton *trv tumno ^vi-d rtsb^eq^ices com|>riM*ig epj ropes re, tm%(SRjc 

20 detoir?uri«tisits suo^mual^ <^pOjiyi.hi> tor ttn; lmmtmo^uuk proper^ i>i a pr^peplkle 
&ud r>img ^Lipmble of evokmg tw isismitsc r^pims; 

I'arik'ulsrly illustrative giolypeptide-'S of the ps^m invention comprise 
those encoded hj |x4>mickcftide iffquercce forth m aav one *d SFQ ID 1-1 U 5 
115-171, !?.Vi7>, 177, 179-305. 3Q7-315< 3^, .08, 330, a:»2«335. ^40-3?S, jtSl, 3«2 

2S aod 524, 526. 5:*0, 531, ^s?, 535, ,\^^ 5 152, 5^ft72. 5*?. y»t, 5^06, 61 K> 

?5U '5?. ^4, 77*. '7© ajia TS6-7X*, ur a soquonee Ut^t h>brhJiafjf under 

5ttodef'.ttely stntigtfJil condtU^as, or, ^Itt'raaiivsiy* yndts hsgUly atrit^nt uoi^isiiiJEss, to a 
jpt»Iyaui;1eolitle vK^enct srt forth b any owtoi ^Hg ID XUs. Mlt, t!547t. 17/1475 
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!77, 179-305, 307-315, 326, 328, 330, 332-335, 340-375, 3*1, 3*2 ^id 3K4-476, 524, 
526, 530, 53U S33> 535, 536, 552, 569-572, 587. 591. 593-606, 6J8-626, 630, 631, 634, 
636, 630-655, 674, 6*0, 7IJ, 713, 716, 720-722, 735, 737-739, 751, 753, 764, 765, 
773-776 arid 786-78$. in specific embodiraents, the polypeptides of tfee invention 
S comprise amino acid .sequences as set ftirlh i& any r-rse of SEQ ID "NO: H2-U4, 1 72, 
176, 178, 327, 329. 331, 336, 339. 376-380, 3«3 % 477-483 t 496, 504, 505, 519, 520, 
522, 525, 52?, 532. 534, 537-551, 5S3-56H, 573-586, 599-S90; 592, 627-629, 632, 613, 
635, 637, 63S. 656-671 x 675, 683, 6R 710, 712, 714, ?i5, 717-7IV, 723-734. 73ft, 740- 
750, 752, 754, 755, 766-77^ 777- 7*1 and 789-79 J 

W The polvpepskks ol' the preens invention are sometimes herein referred 

to as prossate-specific proteins or pra^tMe-gpeetflc polypeptides, as m indication -dial 
Ifaeir kkmifksstiosi. has been based at least m part upon their -ncresssecl levels of 
expression in piwfate tissue ssuriples, Thus, a 'pmstate-speeifk polypeptide" 1 * or 
"prostaie-speeirk protein,*' refers generally to a polypeptide sequence of the present 

IS invention,, or a poiynocleotj.de sesjiie&ee encoding Mb a polypeptide, is expressed 
in a substantial proportion of prostate usstte samples, tor example psreferaHy greater 
ttfaa about 20%, more preferably greater than abc.-^t 30%, and most preferably greater 
&han about 50% or mote of prostate d.^ue samples teste*!, at a. level that is at bast two 
fold, nnd preferably at k&si five fold, greater tjhan the level of expression in other 

20 normal tissues, as determined using a representative assay provided herein. A prolate- 
specific polypeptide sequence' of the invention, based upon its increased Is^vcl of 
expression in tumor cells, has particular utility both aa a diagnostic marker as well <ss a 
therapeutic target, as funhesr described below. 

Sr certain preferred embodiments, the polypeptides of the invention are 

25 MtsmasiO^uv', e <? , they rcm*t defeebsbly wttbin unimmuasAJY ^ucb as an PLl^A or 
I -cell stiitvulfiUosi dSstyi with aantsswa! at&dft*r fnnn a patient with prostate 

taneer Selecting tbi i^iunogenk xKtsviSy C2n be performed n^sng lechnkp&s wll 
known to the skillet! sfttyus. For example, aueh s&iee&s. can Ik; ped'orm^d using 
methods such m those Ueysdbed it* 1-latlow ^l-d taoe, Artiibt>tfc$. A Laboratory 

30 Afumf&i, Cold Spring Hasnor Laboratory, 19S3J., In oae 'lEustjative &xampte, a 
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n. fvjvp^u^e n 1% K in** <*bi>uul 1 n nkl -^ppi n stud ^.ortokttN* v*i\h p<«unt u ta 
n v* Ail u|? ^tmbodtt*. % Uu t c ^ 5:* ^ V t^iOSi \ \\l pobpep'uk* I t t IdoiuiJ 
sun f ther tv nmov^J jtoU ViuJ antbunjus d?* ^ttd >*>>ii y to-r ^oupk, JJ! I 

S \b wchjUi bv TiLoao 1 ^ b> tht kkttkd sLiAii^tiiu vutkjsof 

fee po \p> ftuAs drvlo h % t jk^ \ tiitu.npu.isc J H the { i- cni mk j< ^fs 
1 'rucnmiogt t>K p-i^jen 1 ^ u i *<i lu^m s ^ f suim 1 5 ■■ i ^ i m* t M^iHiu p^t**. 
ni *Jk H^ni 1 r* ThV i'-t ^ t ten Uv. n In^u Mv k t<nu ir t i po iu il \ i u\* vtHh fit 

19 Inmustv^erR pi?i u,i\ ^etio^h be iA i> % %e3f ki nvnn «\VuqEK^ vuh 
as i^o- h ™?U'iv u^i m I k5j3 } t *^h*^^/<ai M ► ^ V d t, ! , j-i , -24' t tk^trt Pre 
f^+l N -H^KtvnvO est d t*uv> m ^u.h hK ^kj, mukdv ^cjvM!l..i£> pt^itpt tL Mr 

c^nis. \<> 1^0*1 k Kftici ant i<-n njvJ rk^odit ■> rt Vtfiy u- pvU 1 ^ H t the* 
15 'p^Tfvf^ hn>i h ! m vSmiw j the} v>m*j tk > ot^hi *t% ji J i IS i t*r ^Ucr 

Its cab x |*-t^vTOJ ou^KHUmiimt, ^tt iuimuiK^oikic |Kwti^*j i^f ^ ^hptptiJe 
the fi>f^rt>i! nf <or t pelt. hi th<a reac^ wth ^rti^vrus a«d.<. t l"-Qcih ht * lev?\ ihn 

20 is nt txd^v It^v. th«i the u^wtt-v ilw ^tll-le^tb poKpeptuJc m ^ 

the mtTnu^s^^^c ]h>Uimj j> 4 fc^b^ about 'H^'V^ pu^ci^ir-h ^ ki.^t ^^yt 70* > -md m^t 
\n\iet<ib-h gt^itrr thins jbnnt 'Ki^i rj«- ?mmbf\ii^yictt> r*>s tulM^th 

iuno , a\c! mnnnuKi^JiK tk*tviu ^ivdtejt duu tlut ni the corsTtisfK-md issijc fiill-t<figE3t 
fi^rpspatV, * s', h^s«n ^TOikT thai 1 * abomi 500'^ or l^tPo »s it^tre imtauinogeuk 

itiulbsJii pepi&kr m Vth.it \ *m\ tett;^^ le-^bs soi|UfJKt uhl/vf tjiiEisnR-fftbrtins' dosnssiii 
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&K<xot i. ten it&al Jtkiitu « i MO mo ind prctefahh > *narh» -uikt 
amative to flB^aiHJtupe- protesEL. 

ki unothci ^uliodiiivR^ i jv\|:x^>bdt ■campt^si^GD o f its n\L<\t<& «i > 
^Lji owpme 1 *v tuot? pnjtpeprd* 1 " tiit "mc miiFOJWslogKiiTj resits "with t uUa 
s jflof ^V\i*i% generated sgm^i p»hpAfHid^ ibe mvettfnn par 1 ut.hr U <j 
j\^j\p'hk hmr^ antra -tquen a ,W!*n>e>* Kn^ti, or tu ^ mimi^^ 

hi a>i\<t set en=\\{ if^n it tit LV imu tio t, ivih peptides ptovi/ed *t it 
o<npi v o*. o-jic 1 1 pioiv Mb *wn*iJt ^mIlK <^\ihk * elk itarg; I ct 1 iLia o* a*too<L^ 
tott nmnn*r sfogK.ii y rc&ctm vutlt <ur cr ruo^rc px>lvj*>yiides d^ov xwn oi 
^r>e or more |mh peptides sailed i>> eoetigiWii?. mtUeie <.vsd sequences run^ru^d us 
the ptMv8tic!ei*ade: -*eqiieait.£- di§iloW hcteitf, at mawixn^m^ trj^tntois >vr \^r^i.t\ 
thfiieu/, x*r tti oae uf ni^te ajcleti, acid ■.eo^nct*. vshith h>luidsy*j to ont, oi mure cif 
tivw se^kTC*^ t^nkr e^rkkt^ns? of nhnkut^ t*> biph btrtn^enc}, 
IS T be pr^mt eotb »r„ tfs os* ^her ^pwt, p«w ide^. poH peptide fctfcsmerits 

coiitpnsaug ^ k-^afishuiu 5. K\ I5 i 2u 2^ mj, ot 1 00 *xHstijguaus anuao jcjJs, at moit? 
mehidiKg all ujleiirodwtc lertiilhs, of a pohptff tide composition *?t't torth hcrenK jitch a* 
fJW «*t forth w sro IUKt> flMl^ P? f | »*, -,2«*, i^i, iH 
*SW, 477^^, 4%, 5t>4. V\ \\% Oj, ^22 3 ^27, ^34, 53^1, 55"*^oS» 
20 5S^ 5^^ ^(1 ^>2, t*27 ^*», > $33, 617, 018. t»5«* «>71. t>"»5, 683, 68^ ?10 

H I, "It, ?i? ?1 r 7t*+. "^i ^v-^k 752, 754, ?;? 7g> 

7^.^-7^! or thvjse eocodeiJ h\ ^ p.^>nmleoi'de a«i|u*fRKf sot forth ^ seqiK-five nt ^FQ 
iD NO* l.iil U! 4 171-17^, 177, 17^ 10^, ^)/-M5, »26, ^30, *U U5, ^40- 
:>75, ^^2 and W 47A, ^24, ^26 5^0, s^l, ? *^2, 5<^71, '.87, 

25 m MH>, At 8^:6, d30. *K c-^, ^\ t*74, ttfft, <j8I, 71 i. ^ 7k>, 
W Ht, 753, "W,7T3 /7*find78^78ii. 

^ another :±?peci, the pies^nt invention pTvsvul^ vanaj^s i>t ths 
pkii>pep^tde v;on*pOHt^i^ d^enbed nejeia, Pol>pept^k ^^fimtjs. gener^H^ 
txfiCinnp^ed th* present Jm«Tt50n will npj^alJy eidxibit <H l^^it ahetf* 70%, 7?^ g% 
3S ^3 a o, 93%, ->2S, 97 fl 'tf,9S%, «f ^-uvrmsiw ?^at«> 
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{dsktrnmsd us described below), dong H> length, to * polypeptide seqi&siee set forth 
hefek. 

In one preferred etai*ndime&t. the <|fc*hpt:j>ti>Iff fragments ,md 
|3sv>vaefeai by the. preset umxmoD «kre rnimuadfogkalty reactive wkh an Kjiitsbs^dy aad/or 
S T-eeH that reacts whh a fuU-Iength polypeptide specifically s>et ItsrtSi h&re^- 

I» another interred embodiment, dve polypeptide fsagmest^ and varbRts 
puivideJ h* tk' fKCMtii* imuttaon r\UHt ,i lexd <>l hmuuiti i^aic >*cti*>tv. ot it V&sa 
about 50*>> + prelenfcly at least ab<nit 70%, cssjd most preferably at least about W/a ur 
aw of trust exhibited by j fuy-leugth pt>iv |jep4Kls: si-q^etxx specifkaily sat forth 
W hertsm. 

A polypepticls "vanuatU" as the terns Is used hermit, is a pol) -peptide that 
tjpKMllv eWers, liom a |«^ f peptide ^pec^j^Uy la^o-sed h*rem m ooe or more 
■stibsuttiEiosw, dd^iivHis, additiusis ^ijid'ot insertions Stoeh variants tr^y he ttaiursiSly 
oc^urrm^ or m<o. It 3vn$M>*\iUy kiercei\.«e*t lui tiv^npk, l*v UK*di.iyi%ooe «k« of 

15 ih# isbovf polypeptide ski^teru^ of the ksx-endon and evalit-it'n^ their immunogenic 
activaty as described teem using any of *t t^j-inhcr of techni^es xve-ll kaevvji m Sfstr art. 

For e^ompK ^lostsutrve variant of the polypeptides of the 

i^vomioa include those in which one or uxne portions, such m NMennu^d Icsder 
Atft|5ifsc«" o-i tHM^mcmbt-me dontaio, have b«:n ratsonxl Other illus&aEtive variants 

2sQ include variants tn which s small portk^ (eg., 1-30 antirso «dds prefeably. 5-15 ansiuo 
add?.) has betsi removed from 5 be K- and/or C^ermhud of tht; n^tMse. prulein. 

Ijfs massy instances* a vssriaM will cOEJ&in conservative; sdnstibitioias. A 
\^mstavative substation'' is out? io which an *srmno a«id is substituted kn ajjftmer 
amino scid that ha*, similar properties, such that ou<s skilled ict Ube art of peptide 

25 ehx-mLN^ vtxuhi r peet tin* seo>nJ,My statute .iutl huiu*paltuc nstee %if the 
pohpeptKk 1 1>* be ml>N?fmtk^h tmuia^gfcl A> de^tibsxi abo\e t uk\is^^tk<as jiw> be 
miiiie b thtf structure of Ibc potynuebotisks and polypeptides of die prtxseat hiv^ntioo 
m\i still obtain a tbaelJODal molecuk tha^ er^sxios a vmwit derivative polypeptide 
with desirable charaaeHsiic?, e.g . with insmiiougeuie eharaoterbtie^ Whea it is 

30 dcsuetl tv-> ahs-t tie Ammo jciJ ^xpiesivf tit j pofype|>bde Kj t-reate si* {xjtsn^lent of 
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t-\en at* tmp^ve^, immun^&Enk variant o* portiw c*f a polypeptide of the mvenis^n.. 
i. me listed in the art wiJ! typkaM> change *>rac t>r j^nne i*5 the cqdoas *>f ihe en^tniin^ 
DMA sequent according to tsHe L 

Foi example, oeitain ammo adds, may substituted for other aiamo 
S icuis in a protem structure without appro; tsbte loss of interjsrUv? busdmg c*sp»mt> %vrth 
structures such as, for tx\smp^, ^tigea- binding ns&io«^ «ntiixKti<K i*f binding £ttcs 
0J3 substr^k m<ik\^te« Since; it & the w^r^ne- c\apadf> and o^ure i*f ^ fwoteie tb&t 
■d^hp&i that proJ<Men\\ btoJis^ci! ftaot tHii^ <*otHd*.., L**sl&m amH^ and s^jjcklv 1 
iLtb^iituiiom e-sJi made in >i ptofc?sn aequ^ee, and. ecftjjse,. its usnkrlyun* DKA 
10 cudsag f-equeitce, &nd nevertheless obLitii a protein with like pn"i^i(k\v It thus 
■■"^rtero^latc'd ihat various chsn^cs mny be made tn (he peptide sequences of (ite 
-i^lo^d co^F^ttkMts,, Of £OTnespof5Klu^ ON A ^i^u^cvs whiUs etkx^fc $#kJ pfpttdsrs 
v\iih4)yt ^preo'tible Uk® i>fihtk bh^op^d Vilify or stttvtiy 
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S ow-ic^em* TV ttfcy&rtmct? <i ?1k s^fK-psnhv artful 4u*f wn.V\ ir , niiternm> 

hviruiathic OKirK'toi il* *i«;<n< oa>l ivmreKt^ n toe ^«.onJjjv "taKtJir > i the 
iOsifUmt ^dn< which i ft taw define the mttiarftoa tf tht ju^n v,vtb ^hcr 
10 . mckcul. s, ^^i s ik, er/Mt^s, >uW^ir-~, kvo^rs f i,\ \, -nn^Mi u v ^i^ns saJi 
ihe bfrs. t'isKh iunhi** add ha* been ^guirdl a hydrvprufeic uviex -m ihe fcaafc its 
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h>drc*phi>hiest> aad ckwge chajwktfeissifc.ii t.&.\te Riyi thyollttis, Th&>c vahics 

' ><4 'kit s * I v i ^ifk^ 1 ^^ 'it ux v +* i!\ i^^ii'viK { t- 1 si <wutk. cwiye 
\$ r^thn nn 1 1. I ' > tofRjre {-H H> ghvir-r ( 0 thr> oniae, ^ o i 0 hK 

kwpt^Ai^ { tstv^n*? 1 I 1 >), ptoYi? ( ' (n h*<jj t d '3? « ^ ji a v^te- i * ^1 
5 gjutm ni. { ^ N i^f^c ^ is|>u k f 1 ^ I- -. r^L j 1 tL^ mine- { 4 ^ 
i su^u* n 'I 1 *! .ir' th >ii .mitt o ic id , m t i\ he v Kt H <e<t H t 'her 
,mtit\ auk* i * i ^ ^ tmtm n^ffopaJ sl. ift*LA <. ^ su ft *jJkT \tatt if* =j a pm-fa? ^ sili 
m^I i' b-n sigkai «ctntt> »■ <-tul t>U<ui 5 KoI s t> hLiti'MniJh ■KijinUikL* j r urn 
('i t u\\ ii L ^ > i vhxi^ s, t*k x^vutio i k't ^hums d wltcc Nk* * padm srui 

irt. n-, s-l i ir«>.uH iv prc^*^ J* t-. tix» i«h et\UK 1 in % rt th ** tV ^il stir iIk 3. 
f if kki Mnrto t< J i ii m, Jt UkLtnJ 1 n>i tV ot 1 ^ At^ LlII 1 s Pjtuy 
4 -.^d 10} {<pK tu*.i,h uiko» v v UtO Vimj? I>v i kj-tiu^ in N uUi^> ^iHi fl. ^ tot 

15 it kl t t^.L s ii. ik* .luds ^kvtr^iaj^ vnU* a hk sgj^id ^ ^pt;it\ t*l ti 1 1 i 

Waukdol ^ Patau ^^i^^LH,* to.k^ \di^^Mut\ 
Vm a« -tg^s *o iffi' i) j ?d tes vHk 4 Atga^nc Kvnc ^jv^m 

ii h, ^lu%mile ^t* 1 0 - \ ^Liin<i x^^T) ^uftit*. i^) ^ >. hit iiMt v ic-. t ^ 

^Ulikfi'^Ot d <-k M ' < \\ pK^iUtl -0 ^ t" ^ dlfflUkM ^> i \ xllSt^k 1^ t t\^ttd*t 

20 t-tli H'ijtljJikS.'UK {- > ^ " J\i>? { * lok=ix ( ' Si ^oJcuktnt i 1 ^ nui^rif i 
"* ^ p^itnvH! ^ufie { *>> txvpti pban (- ^ ^ k H i 1 U5\tc-^*XH$ o wu» ^"k* kai 

i>i *> yiuiU-y ^k{un seat dnJ tt rurttcnS tr ^ innim> 3 "samlS n\ ilt.il f n it it Iti 
^vjcb hdn£}<^ t iv ivjb&tat^tkttt of ai^ino ^liis ^li^ ^, liv t vph. hi.u> \ 1 jiv wtiluu ■+ ? 
25 puh"ed, t*"^Jk v* thin i 1 >.i^ paittcuK "h prtk-md itvI tlh^c ^ itim tv* ^ ^? k. even 
m we pvjrtK ut xrh yrt. & m<& 

\* *>iithn«sj awjv ^n5'k i it* MsMj&it ^ t » ^ *h^Tfffm b^\1 

o i * ? e tdAtn-e ^tm far of tbf AJt i »> ^d^-kham tulh^ ti utt,, id? ssxii-npk *bcr 
b^ dri^ph^b k.itv biAx^h t-M i.fwg'C, c ( Aftd %e hi** 1 xeni ' trv MiKt'ttttk'n ^ th\i 
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<4 Mh- art i. si j\lii,t wgr? rt i d wax U taru*e j*)d nnnt aid 

thrn.r jne gb rr im ^ 3 i§ <ie v.*ul vi bnt cine ^ d euv uic 

In <H\inu *u% pohf^c <s ttdi rm be firlr-er t»^*i«u tu wwat* 
*ab'hr> t ^ * f\> ^ rxxl f cacu^*. r<Uad^ h ' n r> ^ intW* t f *1 ^ tudrto 
5 f i 1 ^ i & seqEX at -.he s -H or ^ t.iw<> J t; i or p&x ^r^nifnuvr? o v f> =, t'h I 
~v be L\n p»sh>uM(t,i? "im^es it V S^J-bose, adf ♦*« i L 10, oi 
ror^TXktnc al base- 11J* a n-osnc ojeos v 3i,d ^ x i ts>su -v w t n n ^cr t 
-H.fi x - tW v^id < 1 „i ti* ti<Ni , -rn cr i-^Jd^ u r<rt\ tNnnac ^ J 

10 \riiifl ^kid s ^-t'tj^nju n» * \ n\dk. > n tli^ M < at mhWw>< 

<i pt lmit\ t,' Itoftv r f>phQ>„i h>J n*uVfv mj e-± t'p ar p i fafhc 
nature the io i^je-, }o tA*rrt> ^tjat-'uh dasjvd >«iru su, wlut a^tut c 
i&esd and glutwtae fics^; positively charged wmmo aeisss mclud'e lysine arid STpmsie; and 
amanu acids A<iih recharged rx>te he<ui ^tup*. luung sm^kr h> dropUiiicj?> \tkkes 

15 include leiseyrte. isolefccme mS v»fme; glycine ami aiajjjne; ssparsgirjs and glut^iine; 
and H^rsne, threemms, phenylalanine and tyrosine. Other groups of amino shai may 
represent conservative chaagss isciade: (]}ak, pro, gly, gk, asp,, gin, asa, ,%e$, thr; 
(2) cys % ser, t>T, ihr; (3} vat, itc, leu, met, ala. phs; (4 tlys, arg 5 bi<; asd (5) pfce, tyr, \sp, 
hiK A Vfoiant naay w alt&riwuv&ly, contain ^on-carsx-rva^ve changes., is a 

20 preferred Krobo^itis^isn vanaat pol^peptidi;* diJDfc? frot^ a native sequerbx by 
substitution* deletsoi^ m adjitiora of five ammo ae5<3k or fewer. Variants raay also (or 
alternatively) be modtlied by, tew example, the deleritnsj or addition of amis5o adds mat 
have n^ijkiHii mil?i«r^c on the ismrit^H-.'C^H'Uy ^c^uridar> stnicmrt. -ujd Indroi^thH- 
mhxv of the pwty peptide , 

25 As > iot*<J above, polypeptides may o - o ipase a sigjwl (or leader » seqocste 

Sic N-tenrae'il end «i me f«r«tcln, which «o^jBialJtic«wlly or riist-trsrakltoniiHy 

other sequence tor c*w of ^uthesi^, purificaliyo or ideatificatioai of the polypeptide 
poiy-Mis), or tu enhance Nndiag of tiie polypeptide to & solid support. For 
30 exsMKpk, st polypeptide tawy be tonj*iga«sd ix. 4 an imraui.ioglol.mUn Fc region. 
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Whm mmpmng ptijp&pMs m^wmmu, two se^aee^ are s&kl to M 
'IdeotkaT s f the sequence of amhv jckis ui the mo -equcttcts a; the same when 
digged for massmuio correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
5 wradow to identify and compare local regions of sequence similarity A "comparison 
vvfedow" as used herais, refers to <* segment nf m ka?t about 20 coniigu-w po^tson^ 
usually 30 to shout 75, 40 so about 50, in which a s-etiticrKie may be compared to a 
refertfi&e sequence of the sane auirsbet of contiguous positions ^fttr the two sequences 
are optimally aligned.. 

10 Optima!! alignfti&nt of sequences for comparison may be conduced using 

die Megaligrs program w live L^sergwae suite if yomforemtks ^ftvaj-e (BKASTAR, 
Ts»c, f Madison,, Wl) using default {sardnieiers. This pmgrani eiah&tlies several 
alignment sahesues described m the following references: DayhofT, M.O. (3^78) A 
model of evohwionary change m proteins - Man sees for detecting distant rcl&Uonsrnp?, 

15 In PayhofT, MO- (ed,) Atlas of Pmtcm Sequence and Strucmrc, Maiioajil Biomedical 
Research Foundation, Washington DC Vol. 5, Sqppl. 3 S pp. 345-358: Bean I. {1^90} 
Unified Approach to Alignment and Phyfagsncs pp 626-645 Methods m Empnohgy 
vol. J SI, Academic Press. Inc., San Diego, CA; Biggins, D.Ci ml Sbaq*. PM (1989) 
r^j9/avjf:I5i-J53, Myews, E.W. and Mulkr W. CA^tK ^11-17, Robinson 

20 h it omt Cowh jhev Jt Smtw, \ M O^S?^ Aft*' i\>t 4 W 
4^5, Sresath s VRA and fsoksi R R, <t^?^ AumfWi?/ r^fWJKy (he Prmefete $ and 
Practice of 'Numerical T<ixonom\\ Freeman Press, Sari Francisco, OA; Wilbur, W J, sn(3 
Lifmim, D.L Proc N&t Acad, Scl i/SA ^.-736-7^0. 

Alternatively, optima alignment uf seqatijcos tot Lvmpsn ,m nay be 

25 c^rakkted by the local identity algotilbm of Smith and 'ft'&termra. *^#1i Add APL 
itmth 2 48i\ the identity alignment JgonJhm ot Nero^etoan and ^unveh U 13 ?^)^ 
tf.V /W. 4^,443, by the search lor ^tmiLnttv rjjnh?Wh ^f I'ear^ I ipmm tl-^X) 
r*tH- N\ui AcM Set USA &5; 2444, by ^rapoterved jTOplfrrumUtions of tiioe 
(detifithmjf (OAP. nESTFCT, fit AST, FASTA, and FFAM'A m Uwr W (luetics 
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Softwaie Package. Genetics Computs* Group iGCC}, 575 Sci*tw<? Dr., Madison, Wl), 

Of % inSgNKXian.. 

Que preferred example of algorithm* that ^ suifeibls; for determining 
percent sequence identity and sequence similanty are the BLAST sad BLAST 2.0 
5 algorithms, *vfekh are described in Altsehul et al (1 Nud Aads AVa\ 25*33$ t 3-H02 
and AJUchul <rt aL ^990) J. Mat. Biol. 215,403-410, respectively. BLAST mi BLAST 
2.0 can be used, tar example wiih lite jj&ramefcr* described herein, to detsjfmiae percent 
s^uence Mcelitv for the polynucleotides aisd polypeptides of the invasion. Sfjftwans 
for r^rtormsng BLAST analyses is publicly avaikote through Ibe Nstiorsat Center for 

10 Biotechnology. Information. For sanino sequence, a scoring maim can b« iu«cS to 
calculate the ciumilaiive store. Extension of &3v word hsts la eseh dueitk'^ sire hat&d 
wheys: she cumulative Mignm-ent score fefU i& by she quantify X from sis maximum 
achieved \^lus;; the cumulative score goes to x«n> or oelow, due To the aceiimulatio-e of 
one or more ftegative-jscoring reside alignments; or d*e end of either seiiuerscc is 

IS rej&hed, Tte BLAST algotilhrn par&netm T and X determine tfee sensitivity and 
speed of the dJignornem, 

In one preferred approach, the '^percentage ol sequence sdeimty'' * s 
delejmised by comparing » optimally aligned sc^ueiKes over a window of 
comparison of al le*&1 20 positions, wherein the porCiua of the polypeptide scqitefjee m 

20 ihe comparison, window may comprise additions or deletions {is , gspsf of 20 percent 
or loss, usually 5 m 15 percent, or 10 to L? psxeem. as cotnp&red Co the reference 
sequences (which docs sot comprise additions or deletions) for optimal alignment of She 
two seqye-n.ee*$. The r^rc^ntsge is calculated by dete-rminbg the number of positions at 
which tite uientk-ai annm* ;&id sresklue occurs m both se^iieateji ki ykkl the number of 

2S matched pwiij^is. dnklit^ the number of nruvbod position b> the tot,*] siumh?? of 
posit inns \n the reference wifueite {i.e.. the Window dat) audi multiplying di« results by 
loO kt } teUl the peft^tage of sequence ide^ify. 

Within oOwr iltu^tradve ^mbtHlimoius, a psd^TaJtidc fRay be a fitiion 
ptHypeptkfe tf^.tt cf^iMWM Tnulijplr p^pt-pdde^ as desctiltd Mut\ »t \h$ c<*ft^na.s 

30 at 3e»st oek; p^iyv^ v pljie &s -cfecjibed hcrem and an uaielated seqtteiscti, mvh » known 
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mmo? pf o^itL A fusioa partner may, fof example, assist m providmg T helper epitopes 
(m .amunote^udi tu^n paitrer, preferabV T helper ep^tOjp«x r^gr^ed b\ fautnvis, 
or nu$> ?n exprc^ sog the pn tun i \*\ txf-xt^tou ^nharnr«r^ it *ygher vieU*- tH^ "he 
RAine fecrwbnaisJ p-o*«?ia Omui preferred tu- son pAtrtm «xt both im*u«v'iogKat 
5 £snd e\pn?Hj»»on tiwott pinners? Oihsr tuMon jwt«u - be ^"evttJ ^ a* 

V uK.rti->e sbe ■w^b "it> 'be ^ohpefbde vr tv* ci^bk tfee puh peptide to be t&t^ieJ to 
iteHftM lr-irauelUui ^nparttrcm:> St&li !u**bei ration paitrvi. aKhwk diluui} i^&s 
\Uul * iX.bUtt* pun ^Km of &e propeptide 

Hi^er r^H'pqpruk -rrw ^r£r<*,3v N prrpAitrd us rag sl*r*3«rd 
10 ttv^ra^vK's roclukhrf Hsrroey u r nation Heterabiv, a ti^*on fvhr«t£\fe ^ 

cjkh dri_ the polvpo^nde - orepojxsit ni<^ be «a5»firbled beparaftv** Hr&vd ir^o 4n 
tppropmte txprsvv-ior , %«xU*f Fbt* *■' ord »f ibe D\ r \ ^^rur ertCH>dt% <w 

IS jVilv^p*nie vD-tipuT^ea* i<- hg&ted, ot n tboat j peptide 'usUi\ to tV <f u>J of a 
t)\ A. -sequence erce^n* the seci *»d p dvpep-t'de c n^vrtept v dat ;he roadie f'aroos 
ot the v^asn^s in? n pHsu-c 1 h s p^ia J* (T^r&laJmn m*o >* to^Is t ta,mn po\ repftik 
U 4- -.tans tfce bvb^icJ A-tmh *t ! bom u^vmit r^jxrpndo 

A ^epuJe K&et ic^u^uc ttay b** en^lo^ed to v^rsjatc ifes* firrt ami 

2§ polj'pcpt d£ curt jjoner^ b,* j* at«?*?rkc itifficaont to crv-un^ Jlwi e^cb p*>-vpcpt3de 

fnkh jrJo Us ksxotVarj ^vraJ tert'vj ^^1^^ , ^uth a ptpt'de Inker ^quc^^e ss 
un£o-ipor«ta' ntu the fi xson oOi%pcpt>fk ^ssng v*^ K i^d Ycftm^ue^- ^Jl Wwn m Hie 

t^^thtir .ibiht> to dd^pi h f1e\ Me t\t^ikktl L^toiraLu ta f^j disur ^rib^bfy to „^opt ^ 
2S ^t^"£id^> sl?uct ii* 1 h it touM "*iti isici with iinKtl^n J rpH< ^r* br\t h^oimJ 
potvr^ptidn^, k'inJ f ^1 iV J^ck nf humip^o^v r.r uii^cd ^Educ^ th« rriRht rcdvi with 
[)vt jx^iprpnut f^xtK>xil epj^ipti IVfi ncJ p*pi d<r lnke T ^ ierae»\» n& air* ?^t> 
in J Srj ?-o c "dues iHhcr ^e\st tpu 1 ^ ^nnro ^unJ^, »uch ^ Ths A\ i m i\ be 



41 

Pru \ati -k**/ i W K>* t \ IHv-ti 4^\"m Us 
/ \ v t No k~^> list xtk- 1 jjt<l«.j ^t.qL>j:*Kw HiJK* J\ tton I to & *0 ammo 

iuih u Irovh i Tv? ^utrtes. -tie iei|uireJ wlieta tin? lifVi stconJ 

^ KpepKVs hn<* t^v tsrCntMf N tumiksl traiw d^t* ivgioa *brt *.an h? iittd h> 
5 sepiri\ *K 1 I'sUi, omnr> av'p'tv ut >ttJt k tUh*> ra^e 

l~hi 'iii^d D\ \ ifx or ^er-xbv ! raks>) stable 

KSfx r- ok lot x. tp^y t> ot 1*% \ ^ lt>L, l\> ^' to *K ^ vOkHiu uo <p-> odu ^ 
vK fubt p<iHpkptu < C'i ^ut< larK stop .A\kvs re^istd ^ J iraiv vtnf lad 
f# *n t^rjpUoa fu itrt^soj . *r< en . >k \?Jt V t ^ i>\4 ^qt eii^t <, voUusg the 

*lk. tj* ors ^ ^hpvp«iin v s < <■> np'nve <4 puKp> ^u<L ckben 5 ^ fa-rei i 

t o* t i itirii* *i xk\a \ ivt:p<*r HaniMtN o* -~vh \n<*^\ th.Vte ittatiis 

IS ♦uf^jxuioM- ikI ^ep^m, p* 'ai 4 *v *or up t c Movs? td *f J 1 A/4,1/ 

a i"ic j. i kn\xl ei*Vklkmit Lie immtin inu^l tu h-e fvirtntr ts 

fwn mi co t posirh n «J mtihoo- Mi >hm 11^ c s.n <uK^ ^ it g ihf o-vj^^^twi 

tie t jM ti ( S baunt \ppiL t no^ ^$^^S th*. d^ckv^ ot w^kA j\ 
iPK*tati.it 3 kv - 1 * is x.^'^et'v Di kfy K^l ^! rUi- to ^ &jtYiitK,Jco4tdt 
r^gik"i *h>j* ts j tuK^qlct c ot a {>i. t >*\ft.i*,rt,t, /wsK^^'.^a N^fOirV iu> k c ^ k1 
M \ a v*n.n\ ptok^ ^ ui 3^ vi jla? ^11^* tsx^e 1 b\ 4 ir* ^^Mlwt 

^ it ^HEnM Ii^l Ntea <j l \vnK.- l j t}r i-\^mpt\ I n Htoit Application 
¥) l^^S , -tt, bkcik\ u si 'fi'uf ,! fa ,*in {]'- t 9)t hi tkhV,' 
a**. i*tw*ted h^'^a b f ^usiul i. ten \t%m^^t\ i4 A. c^"43rg 

^tau*s\^ L\pr v ^ u n^S knclb -iiuJ ltauuh 1^ o'uHe j piph^tj^ Ju» ^^n^iit th^ 
3D <nva *il 1*1 1 m ^ct?^ Viort^^r, R?> I J \ ^JiIkiKC fit 3 ^ \v- iutu + v htiti*,*^ \u 



^mim^caic ptiypcfisd^ with ■vnhfch si tu^rd, ()«e preferred tusiun 
pt>l> f^ptnJ*?' eomptt-sev j 14 KJ> C-terrraoKiJ fris^ii^M eom&jn iiSn$ to offline sctd 
tt\vi-Lhie\ to ^.1 MIBI^A. Otlw pj£%ioti Rat J! polyjftioleotkfcs ^ciKnilJ)! 
t^rapm*. ni j^a. t iibor-* . s corwciAtJxc^ V^iJcn * k» i t iibout ^^uviu-Hidos at 't> \h( 
S ah \A t>0 mjck-^tiJu -st k »*sE -iivut 1 0*> rut Krt J< :>< at ^ a * o&ou t l\ u k v lectide* lr at 
<l >t ^ hit W imtw'jik dvt ccxode jikfrt^r o* i R'K ptA^p^i? Rt\? 
pot^un w Kks t-npn** ^ n *tj\e <vs\ v r it* in nis vi>qtfc iu ih<tf 

ptKodLs a Kjl? pehptj^sd-, Mr \ ^<s>rt or «*ic*<-ofi or it\is ^om^iv u \ m * ^ * ^ith a 
se^ti^i. 1-L*1J r.^ ^sUiLko-Ul ► ^j&hn Bit u>tMr f re i ran-rt '^hatitylicuis 
10 ad,i ti n , <ieit*uj„ tjiduf k3suu*u\ M.kh du* tta bi^ U dvtiMty the eroded 
t isio*i pt°>$xp ii 'c "lot Mil-vtuL^Lf *>ru s^tO, ei *t\e ^ ^ t^ <m pcKpvrtd^ 
uvrtvMtg a nnt \? *i 12 poSp^ikoV V ^ snts i *> \*<ihJy t>hbti .j.t lr.kst ui TO*"* 
h*tutirv, <thw ptx. t_f.ib\ LiM Aiiit $0*" H iJcr irk* n< s* prttetdh \ Ic^i iKuai 
■tt * u i Jk , 1H} to a h skx "ooaiit .\.tt. -l. kv dut ^r c* -d > imt^ c *i it ^ j oh pq tkV Mr \ 

U iikn>n b i)htr witir J ffliVl n t <-ti ^ »,rsm nol ^ 1 3< 4 h p^&nz* t\ 
n^t lRt v i |\oicui D H ti ^u.t\iL«. ivio'tu oi dx ar ^ m$&t^\. b^^^iun Fk'h^Ki m 

^ioro\ nnk H th<? tr^ tlu*d oi tn priori h ^ t v t hr^t N-t^r^rA 'ihK *0 uli^h 
20 ^Ki i a pro-it >n *l Arini «e tu\ x 'ipsdittd Witiwi tuttn fKt^ned 

cmboditr'eriv, tbi rir^* 10^ audi t o* j\i ottiL LT* *v ■sioa p^riv^i ^ i mUudcU ti c 
terrain s to pro*. kK rbt polvptptadc sdltOfm x \> v^ 1 T-ctU «.p tipi;^ and to 

it\K^ the tA£T4*\-3 ^ n * s .) r i f nu- i\*rkt *n**ig n n txpnes^ oa "tAiJHv-V 

fV hp J t.rl t^u^s ^nral p^cftat^m it (b^ .mLgui to ifi* pr^sait*n^ Is 
2J CHhtf Kin pirta uiwbit?? tbt '*sn^*rbA.t»4- 1 prD*ciia Lvnr j^itu^r-'ao \t js. \Si 

ir^i ttidt £ k i 5 dc i ^ >ito 5 ^ tis* be u^d 

^1 a-QOti^i, ufiksLifte.'st, the tmnu-ol^ fi^mn patltti:? ts iK 

30 den%t?1 horn u^^vp*.^ cm prh**i*iiw v>h>\h v,nth£sit£i> m N wt\i 1 ,iknt«e 
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maklase kflowa t Hi- amsdase iY fA (erxxxk-d by tht 1.>-tA gctte; 4.?:263-2£2, 1986}., 
I VTA is an auioly^i thai speuttejily tkux&k& ccrum ht^nds h :he peptkiogtyoim 
biitkis^e TIk C temihisi dmmm of tbss LY1A v-x&mn is responsible for the affinity to 
die choline or to some- choline analogues *;tsdi as Dl?AF, This pm].mty has been 
5 expkii&d tW fhs.* fS^vclopsnent of E. c*>Ii 0-1, YT A ^pressing pkismids -useful £<>r 
exprfritsMrfi of fy^jofi pt^irs, Punfkvidoa ul h^brkl protdn*; tonUijrog the C-LY'TA 
Irajiiise^l at the asnsno itrcninus ha? beea described (.t^- IhamAmiogv 10:795 ^79%, 
I'Wl). Within a pj-eiwisd etttbfKJimast, a repast portion of I.. VTA may be incorporated 
i&to a fission pofypepiide* A repeat is found in flic C-terminal region mailing at 

10 r$ssdi>e 1 78, A partkukdy preferred repeat portion ijKorporates ressduea ISS^05. 

\et aaofer iHustntnc embndin^ni unolvcs feiort pdypeprktes, 
the poJuuvdeo^ilt^ w^mg tteos. whesem tfie fusim* jpajtwer ^05nj>n$.<»s a taijiettag 
signal enable <A ilheetbg si polypeptide m the endososrially stomal c^rcipiirtmejU, as 
tH^.nbe<l *n I \ PaUV ^ 2 J- At* iror^wsef « poKiM-*^ dh. i^etitiMt, 

IS whwi fu^l with thJSs Hiy^Smg s$£mL wit m.-t^uiv* more etf^^mh wth MHt class 11 
mcrsit^ciiles and thereby piwjde avhauocfi to vivo stimyl&io-n ol'CIM* T-eells specie 
i»:r the i^h'pMspide,. 

Polypeptides af the. mvmt'um tire preparer! lising &\y of a vaiisry of well 
knowa synthetic and/or recombinant techniques, the latter which sue- further 

20 4^cnK*a HrkAs Puty^pud^ f^rtiom >iwt otk'i ^tiieralts le^ sbout 

1513 .smauo ^eM- cm he ^[jcairted by vys-theis*. wa&s using technique* well ksowti Jo 
those of ordinary skill in Use art .fo one illustrative example, such polypeptides are 
s?nthf\iKWd tts^ any ot the cornmercutUy a\aiiafele solid^phasc fe?c?snique^, ^iv'h as the 
Memftekl st^kl-pbose syailje^is mefhos;!, sm'mo acids are seqoentbUy aikJed to a 

25 grimrcs* amiiK* m*} dkim, oSVe M*rrtf«ddi, J Am (,h<-m Sac ,?14^2l4f^ 

Lqiufmtut t< : joioretiteJ s^nthn'^b ot polv\>tjptktei» is cofttKien.'i^Hv a-v^^hNe irom 
supplim such 33 Pcrkin Elt^er/'Appiieil BioSyatem 1 ; Division (Poster City. CA}< imd 
may he opt?rate<( acc^diaa to the mamifacture^s ms«ructions. 

h\ ytsoery, piilypeptidt; compositions ^BClyding fusioa polypeptides^ of 

3& she tirtciuion .«* kuJ.s^d An "i^Jmwl" pohpeptufe ^ ntnr \K& \$ Kmo\*d faw tt$ 
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anginal environment For c^a-sapk, 53 nitumliy-*x erring pmteb or polypeptide as 
uoL^&cl it it ts separated fsvro ^>m« or .dl of the <o*\i:$ittg material^ in the naturaJ 
sy^ian. Pre reran!}, such polypefttadcs -are also pu^fied, ±\# , are .it about * J $% 
pure, ffiojt: prek^iibly at fcait shout '$5^ psine and mo-t ptel'trably ai l&kzl ahour ^9% 
S pure. 

The pr^ent bventittfj, m other SLvpecte, provides ps ^> nucKnixk 
eoraposUkm,* fbe temv ■"PNA" s^d "^olytiLideoiyi;" arc u.v,d es^orsslly 
lateichangeabry hejeki to *etc? to 4 DMA molecule thai luis bom isnb&d Jk-e c>£ im,\l 

10 jitnomk DNA ot « oameukr specks. 'isolated/ 1 33 sscd her^n* mea&s that a 
pd> riueteotide ^ s>«l>stantjaHy awivy from other coding sciucno&b, anJ thsrt the D&A 
m^ee&ik doc- not ct?niaar* Lrs^e pmtions of untested coding 1>H\. such as l&ge 
oruvraosoiikd tubmen* * vt ■ ther fuih.is*im2 j^i^ <*■ p^hoq-taJe L^lsn** n.^toti c Of 
course, shn rcretb to tlr 3>NA rooka^e ongmaJl> tsobtoi and decs not exclude 

1 S or cvsdm& rt gss ias later add*?d to the .isj^meot bv ther twid of m-Ao 

\s will be Maidesstood by those skilled; m the Ml. iht; potynuck\&jde 
4 empoisons of this mv^utjan caa meki<k genomic so^wracss. exir.^gerromie £ud 
p3^mid<^«k\1 tusquenccvi oij-i smaller e^m-eersd gene ssfgmeob that express, or majf 
be adapted to e\f neas, p-rtstesris, polypeptides, peptides ^nJ the like. Sueh segments may 

20, be tvatursUy isolated, of modhied syMhc-ucaMy by ttie h,iKd of man. 

As mil be ijjo recognized b\ the skiiicd artist, polyt^dcoddu.i of ch^ 
mven&on may be s%5&le-^tvd6d (ending nr assh&en&e> or vloybk-jtraisdod, a»d isay be 
DKA (g*no»ik, cBNA or syntnetk) or KNA cnoleadcs RNA molecule"! may irtfch&fe 
HfiKHA i«olti^uics whteh e^^L»o iutron* isnd currap..md to a 0NA ax^eoyk m s one- 

or ntJR-eodiag sequences, issa^ but ac«d eoi, he prcscni v^uhin ft po^nucfe-ottde of the 
present invention, and |x>tynuclet^dde m.^ , bul need riot, be linked to othct molut sites 
&rKl/ftr support m^ftfialH., 
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Polynucleotides may comprise a n*f tvc sequence (i e . <m endogesvias 
*?e<3usaec that encodes a p*4ypepnde*proteir£ of the m^e&tioa or a portion thereof* or 
may comprise a s^qusmje ^hat encodes 3 wt&jit or denv,i*ive, preferably an 
imtsun^gwiic variant or dctivalivb, of such a sequence. 
5 Therefore, according to awthur fy^pect of ihe present Invention, 

polymic&otidc compositions are provided thai comprise some or all of & polyauekoiide 
sequeoce? set forth in any one of SKQ il> HQs: I A U, 1 15*171, 173*175, IT 7 , V79*-5i)5, 
?07-3l5, 326, 328, UO, 332-335, 3*10*375. JB1, 3$ 2 *irsd 3B4-476, 524, 526, 530, 53K 
533, m 5>6. .752, 569-572, 587, 591, 5^-6S,*, 618*426, 430, 6H, 634, 636, 63<J-£*5;> 

10 674, 680, 681, 7J I, 713, ?I6, 720-722, 7)5, 737*73$, 751, 753, ?44 s 765, ??3*>??6 aad 
784 7??, vnroplcn«n&. p^3> *ufde»nde j^wno* .set forth m oae ai SR3 ID 
NOs: 1-4H, US-rn, I73-IV5, i77 : )79-305, .307- 1 IS, 336, 32K, 33U, 332-335, 3-10^ 
375, 3S2 and 3R4-476, 534, 526, 530, 531, 53.*, 535, 534, 552, 569*572, 587, 5^1, 
S93-tif)t> ( 618-42$, 630, 631.^4,63$, 439-655, 674 t 4S0 t 681, 7U,?U, 714, 720*722, 

15 735. 737-7 ,W { 751, 753, 164. 765, 773-776 a*d 7&6*?S8, ansi dtgenentt varies of a 
pohauoWtidt^t^uetv*- \et Ruth »t w> oae of 5>fQ IP NO;* MIL J15-P; 1 73-3 7S, 
177, 17*>-3G5, 307*315. 324, ^8, 330, 3*2035, 340-375, 381, 3S2 aaa 3*4-476, 524. 
526, 530, 33 1 , 633, 5 )5, 534, 552, 56*>~572, 58?, VH , 593*606, 61K-624, 6J0 ( 6? 1 , 634. 
434 639-655, 474, 680, 681, 71 U 713, 716, 720-722, 735, 737*739, 751, 753, 744, 745, 

20 773-774 and * J &6*-72?i In certain preferred embodiments, the polynucleotide sequences 
,vet forth herein encode. iirsErmnogentc p*olyp*2pfides> as described above- 

In other related ernbodbiei^s, the present invention provides 
polynucleotide variants having substantial identify ts% the sssqycxsces disclosed herein b 
SEQ ID NOs: MIL 115471, 173-175, 1??, 179-305, 347-315, 326, 32S, 330, 332- 

25 135, 340-375, 381 382 and 384-476, 524, 524, S30, 531, 533, 535, 536, 552, 569*572. 
587, 591, 5^3 -604, 6 IS-626, 630. 631, 434, 436, 639-455, 674, 480, 681, 71 i . 71 3, 716, 
720-722, 735, 737-739, 751. ?53, ^64, 765, 773-776 and 784-788. for example th«Mr 
comprising at least 70% faience fcfcnthy, preferably a? least Ti%, 80%, 85%, 
<?5%, W4\ 97%, o? W-J or hiprei, ve^tKisxe idtrwtity o«mp,*srcd to a 

30 pi-(l>nuclv»^i,ie scmit'in^ of this w^eow-a us?*o« tbe mrthodij dcscnbwl Iwrein, (t^>,. 
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BLAST Mlysis Mag tMmkrd p®mm&m> m &mahs& Mow), to sWUed la this art 
will rv^ogoke that these values ta» be appropriately adjusted ?o determine 
costcspisftcliBg js.ksn.3ty of proteins encoded by fwo nudeotuk sequences by taking ir&o 
acmiiat eodoa degeneracy, amino acid sbiilarUy, reading frame poRsrionmg and the 
5 life. 

iypealty, polysacieotide variants wiif contain oiw or hjctc syto^itytiGns, 
addidom, deletions and/or insertions, preferably such tbsst the iraisjuiwgejikily of the 
polypeptide errcrwkd by Uae u^rijot polynucleotide <;uh;^i>nii,sHy -limmtsheJ 

rcMive to s polypeptide; eroded by a |x»3ynu<:l«otsiJe sequence specially set forth 

10 r>sr*;m). Hie te*m "variars^* i&ouid also be underbid to ettcom|»aK&e« hooujlogtuis 
(Scenes -of Jteaogeaic eiirigra* 

la additional embodiments, she present inversion provides 
pnljmKkMtiJo (rug^Kut's vorc-ps^mg, xartou^ W^ta". vf contt^ynm etteUii^ of 
^quc'tite loVnni M io, or o^rifU-menfoirv ai, one ur mort *>t die Net^&n-ees disclosed 

I S herein. For e&aruple, polytiy»:leotjde& are provided b> tikis invention that comprise at 
kast about 10, 15* 20. 30, <R 50, 75, 10*J« 150, 200, 300, 400, 500 or 1000 or more 
contiguous ntKieotjttej! of one or more of Oie se*|MeBi-'e$ dihdosed hernia as well as all 
intermediate lengths, shere bit ween, it will be readily understood that "kflermediGte 
lengths*, m Oils context, means my length between the quoted values, such a 1 : 16, VK 

20 19. «r ; 21 Z2> 23, 50, 31, 32, 50, SI, 53, 53, ate ; 100, W, 102, 103, 
150, 151, i 52, 153, «t i icsludwg all insegets through 200*500; 500- 1,000, an*! the 

like. 

In another embodirMfit of the invention, polynucleotide compositions are 
provided that arc capable of hy bndiaRg under moderate to high stiine-en^y comlirtons to 

25 a polynucleotide ^utoee provided heroic of a fragment (hereof, or * complementary 
sequence tberc^i Hybndt^tcirt teekoktoex mre w*-ll krtiiwn in the atf of mok-dar 
biology, for purposes of illustration, suitable moderately stringent condltiorss tor 
testing the hybridation of a folyriuelefjtilje of this iavetmon with other polynucleotides; 
include orev^hmg ^ a ^letum ot S X 0 5% ^1>S S 1 0 LOIA lH *-0>; 

39 hybridizittg at WC-mW 5 X SSC, overnight; followed by vrashiag twice «l 65°C for 
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20 tmirote* ^i<h eacb of 2X, 0.5X *&£ 0.2X SSC oomakiiisg 0.1% SDS, One ski'led kt 
the mi wtJI uaskretand that the strmge.jvey «if tybridization can be r&iiiily manipulated, 
sw-h as by atiering the salt coated oi the hybrktestioB sokmon asd'or the temperature 
iu which the hybridiV^iun re performed. Far esam.pk\ m mtshcx sraboditaeert, suitable 
5 highly strhvge&t hybdcfeattoa coiKtrions mchide thase ds-^ribrd above, with Lhc 
exiTpii'tn that the temperacunj of hybridization b iaiejrea&si, eg, Eo 60-*>5 f: C o o5- 
W€. 

hi certain preferred errWditiserits. the poiyiftfckotsde^ described <jl>o«e, 
e & r p^rjnudwtidc variant*. frugtaeuBs and hyhridhvm^ seduces, eiio^is pyiypqKfftws 

forth herei.i. In oShcr prc.&mrd embodtmtaitx s«cb polyriuckiotidis etK^-de 
polypeptides ihai have a It^eJ of imtn'tRogs'rik activity of at te&fi? shoyt SO'Hs,. preferably 
at k.rt „ibuut "0V t»*J nwte nftterhhlv »rt feast alv^a o-f that tot » pHyfOTt^le 
v&qaeisee sjwdftoalh; set forth hereto 

15 Hie pob/nt&dec^des »i present invent^, or inigme-nta thereof, 

regayd?ts*$ «f One length of the outfits ^q^f n>.eii, maj -^mxbmtd w*uh ottm DNA 
^quoncos, ^seh as p?o?ut^s, r^lyadssi>I.Hs*>n Mgi&iK ad*M^u»sl rsjatnetsort m^n-m? 
$jte,\ iTHiltiple eJtn*ing .vt?e?>, other coding segments, asd she like, such tJut their overall 
length may vaiy eoiisder&biy. It is therefore eoMemphted (hat a nut; tew acid fiagme&t 

20 of afsrfmst any length may be employed with *he total length preftrahfy being; I waited fcy 
the ess? of fstrpaiatiun and mso m the bteMsd recombinant DKA protocol For 
example, dlustraave porynudttuluJe segments with total length* of slj jut 1CM300, abtut 
5000, about 3000, aboat 2,000. about 1,000. about 500. about 200, «tout 1 00, about 50 
base pair* m length, the liks, $ including dl ktenrcediate lengths) are comempfetei 

25 t«. be useful in ujany impJeftwrRtations of this hr^entaW 

When ccjapaHng |x>f>'n«ckoiide seqo^tiee^ rvvo gtequOTS^s sre ssdd So be 
i 1dea^^r ^ ^j , ' if (he se^uemx of'sudeDSidcs m the two sequences i& the Muse whsn aligned 
for maximum corr^mdence. iss dessedbed below. Comparison betw^cti two 
s^H^ik^cs txee typtcfl% jieffi,*rracd hy ctHppayntns Om Bcqiwrwes over a oo*npartsoe 
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witklow" as, used herein, refers & segment of a( kast about 29 contiguous posUions, 
usually 30 to about 75, preferably 40 f» shout 50, m which a se^iitsisas may be c^mpare-d 
to a referemx sequeiac-B of the same timber of osH$%ti<>us posMisirvi after the two 
sequences arc optimally aligrved. 
5 Optimal alignment of sequences for comparison may be conducted using 

the Megahgt* program ?n the iMcs-geae suite of bfomibtni mcs so&waie (DMAS JAR, 
foo., Madi^ors* Wl), using default parameters This program embodies several 
aligmwit ^heroes. d^erihcxl in the following re&rens-c-s: UftyhftfT, MSh (1978) A 
rtsodei of evolutionary change in ptou^rw -- Matrices for tlctc-eting <|isiy^t relationships. 

10 In D^lioU, M O (tfsii) Atk^ *>1 1'jotuts St^uefle^. ^Jrxvaure NWiorut ^otne-b^d 
Research Fftiffiriaiion, Wstshmgton IX: Vol 5, Suppl 3. j*p. 345-35 R; Hcin J. \V)9%\} 
t;ssSftt;d Appfijsicrh to Alignment and PhyEogeRCS pp. ^26^6^5 Afrthoiis in Ewsymokigy 
vol. 1JH, Acadeimc P**$s; bar., S«t DJcsro CA; Biggie D,G, 8twi Sharp, P M. 
C.4#/ttV5-.iSM33: Myers, l-.W- and Mailer W. (1988) CABfOS 4:ti47; RoNa&on, 

1.5 Fr.D, (WUComb Thmr //: 105; Santou. N. N«k M (1987) AM Afctf AW. #:406~ 
4."?^ Hn$Mt.h. P.1-1A. <md Sok^l, R,K. {I9 7 3) Numerical T&xowmy— th<? Frim-ipks and 
Practice of Numerical Taximom-, Freeman Press, San Francisco, CA; Wllbw, W J. tsnd 
LipjruMj, O J (1983) froc A<wJ jfcwt. Sri. l/Srf £ih?2oV?3a 

Altrrrmtsv'dy, optimal sJi&isncat of sequences, for comparison may be 

20 cfciM&icted by the local identity &!g»<«i$hm of Smith <#)d Wakmssa {i?St)/Jrf^ AFL 
Math by the ideality alignment algorithm of N*<kiis;(»a« md Wtirasuh {I $70) J 

Mei FmL 4X by the ^e^tvh lu* Mmilmty methods ^f Peitrso-fi ^id Lspmsm ij^&l 
/^oc iWf/ AcoJ Set VSi SS. 2444, by oompvatenzed implementations of the&? 
algorithms (CtA!\ BESTFIT.BiAST, FASf/A, and TtAbTA m the XVi^xssmCtenstics 

iS So&vflosre Paekugt\ Genedu Computer Orosip tQCO), 57? Sckn^ Dr., Madi^OEi, Wt), 

t.>ne prtfVrrod ex^plt ^t d^^ntenj, that ate ^utublc fbi dttoKtunsn^ 
peTOCnJ iequoace idcjtsdty und jajqueace iiRularky aK the BLA^T aad BLA^I 2 0 
algorithms, wh?ch aiv desarib^ m Altschui ef al. {JV??t W/ ^cr 25:3 3 8^3402 
30 AHschol et d (1990) A*?/ firo/. l?t 5 10, restively, Bt AST iffid Bt.A^ F 



56 

2 0 can be for sample wth the parameters described kw, t* deietm rc ptsrcta t 
frequence lieRtsfr, lor the oo'> ucitMLdes t f the ir\on£toa Suih*.*** fcr jvartocrau^ 
PI AS I as al, -as ts nub'K h rvaiu^ t^rou^h X«ioral Conttt i * BvKxhaologj 
Irtomsfion o k Jl .straws cx^ple k cttrcutat-re xorc> i be eakuUtod ii^ng, *os 
5 ttuckotitte %*<quct cos t! e |\irm rtei> \i irpv.jjd were 1w a par f<* n anhi*ft K*«lue% 
aWn ^(ft arJ N fpena^ ^c*e tor r Mrai<.hi tm res. c^kw ,>1w&s iXrcumm 
l^e? 1 tts> m «f iLwchw a « tultu^ whe:* thv caa ulattve ^rarent sec c faih ott 
by tht quantity X ifmuiMio jick-u^i viluo, ih. c^m^lat^s M-orc goes *ayi 

o«- b^o\\, Jue t,« c a, <. ue raL ;jum t * one t* n» le rK^mve^mx it idw *ijpnH)OHi-\ 

10 w the cimI at othc *^utr cc *s * fwd HI I \lw 'lm wtretas V» , f ?nd A 
d«trPiTiu*c scn,itiviiy md n ttut dhfcnntm I he fi] AMN pj<aju*n 

ii'-Lk«i»*'t*e ^uciiii; J ut*\- A:\Hlt 1 a vwnilct t \V) t r It anJ c .pertain n (t ol 
H „ ft J the Hi ( »M*M6? -cnrnn* imtm tu\- Hen lor* viJ linn koft j Vari 
k <iJ $st i ^ t ^ s j aUgnrrefrts, 1 H) »n e- ^M»t»cn tU ot 1 0 M - * jjkS 

1 5 d urftfs.tm.CKi ot b> <b id- 

^> np&rng tv>a a^tir^r> <m^l\| v.t\ivii'sice^ ^\et <t vs^tlow oi cos tpa iym vt # le^ 

wnsdov* nay t*vmpR , -e *ddjU«ns < r ^Jeiiwii 0 e a^i of A* pesrewji n T k-&, asualh 5 
20 JO * s P"wer t. » r "0 to ,2 percent, as W|Mred to tih« jvrfcrt'n.c se^aefecs ^ubicfa docs 
u>n ^rs-e aodsttcits ot v'eUi oa^s ortjral ^i^ncn? J*ie two aequtn^ 11k 

pevt itafcts ^ ».a'c»lats^ bv eeiem ir^og the ^iTObct ot po mon* »r wluch the- tdet*n«J 

ducKk d biex ou^n ,n K^ih se^^encc n> \»old the vw tber of t vj&ht.i possts^t^ 

d^^Ung tbc luibct )U^hc-d po-sts^m 1< v the u-tat ntiibe of |xs*ts^ the 
2S ic^^rce ^quewo \i e *hc iMndw sire) and ^uXtrT^ng Uie Tcs«l!ts b'« hH» t» v^kl the 

pevetf^t; ot seqtjeiao- tit; ri> 

It vkUJ S. opprc:*aie<! b> tl *HvLmun skill the in tiwt* a. a 

*>f tl c Jego icrittv of gerdK code t^en arc r^ans t^cl^tioe sequs^cc^ that cn.udt? 

a puKfsptule w yh>wUx$ her^rs. tit ^i,>jJi^t^otnJcs hf-ar ru^nui* 
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that ^ *ry djo ft d>Jt.et^ t (V u^m v ate tp^t. a J»v » o ttcmpj.dUM V th^ p^tni 
irwittivx 1 1 *ihci aLtles liu t^nt* Lomsnwy, tbc V^mAluittde ^ jets.* „ 
Hs^hkd borem ^ c within the ^ ^ i i iio jm. -erE nwitsoti AJcks arc f uJogcjwas 
gents \r altered w i rewU t*f f\e or motx et * stums s k's in deVt or^ ^dmons 
( . ^ ik^tutw-u >l nut k^u k - IK- ie^ttw# <t KNA vsid psoVm tim , Htt xvd m<\ 
Imt. an aJtu-t* ^f\jO,in or kmUtoji Altk\ bo d**\tfitd ^<d N rd 
h l!« Kiaes j MM.b * Sis *>m\ if or\ arrpht^ Mms m>U r d ^v<t v atquenc*. et< nr^t on) 

tfetut*^« m Pktlhti iM^KwKrenf it motion a ij^ i p i„ 
x p^» t ,<rrh t^ucK ^ "tie specie n^i*^eiK«3Js is oved r^r Jv *\cpr*utrr or 

h) 'nitvu <i ^rk v*n»Pk id'or dv^s ^f, ^ of the pol> ^epsbdeb V^rbeo Hiem Ss this 
>*SH*road, specif*. "ooiitU«tfiors >i polypeptide soqwemx can made through 
ir-ats^os-is the iisidcrJ>i% |v\a«de>oty«& that encode th^ri. These techniques 
prs<s ac: a ^taughtt^^-yd appm^b n ptcjx^e at*d it t sequence vsna«5t*, fos cxjnpie, 
s^c ^rponting. ore or m<> x c of }K ft-regomg considerations* by imroduang ot^e or more 

15 tiucleotide sequent-* changes i^to the poljMidentide, 

Sue-speeitle mutagenesis, allows* the production of mutant* through the. 
«$e of specific oligonucleotide sequences s^hseh encode the DNA «€qiicnc« of lb® 
ctesbsd 'Tiutatlott, a* w^ll & syffkient jiumber of adjacent Duc!eat3d«$, to provide a 
pmr>c*r sVequeece of ■sufficient sue aj^d $equen« complexity to foms ^ «tal>k duplex on 

20 bstii sM-es of the deiteriaa janc^on besag trnvtrsed. Mutalioas K^y be employed b a 
selected poJymtckotkie soivctcc tn improve, slier, d-cereasc, jBodify, or otherwise 
cbar^e the propertk^ of the polynucleotide ife^lf, asd'or «ifer the properties, tcti^sty 
oompoaJtiott, stabtlitv", or primary sequenciS of tlie WK^ded polypeptiiie. 

Ijj ceartajB emkxisn^wts of the present iBventifia» fee juveiitors 

25 coatcmplsfe the snunjgenesis of the disclosed jaoly nucleotide sequences to altet otic or 
more properties of ^he eoeoded polypeptide* sueh tbe Knmunogenieify of a 
polypeptide vssctine. The tsschniciues of site-specifie mufageRC^is are well-knovvii in the 
^ mi aftf widely u&ed to create variants of both polypeptides polyauekoticfes. For 
e<amplc t sjle-spft.sfic mutsigenesits is ofte,R used to Alter pi spm?k ponim* of a DNA 

$Q jwjjucttle. In such eoibodixneo^ & prunef compftsijig typ?c-afiy about 14 to ^hout 25 
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uucieuiifks iv r so in k^gth h rasployect wuh about 5 to ahmit 10 residues en both sides 
of sbtjyncum* of the. sk^ugjk* being jittered. 

,V> will be appreciated by those of Ak.il] in the m% site-sped fie 
nujifigt-uests terhmf|it!»s h&ve oiteit employed s phage vuetor that exists its both 3 single 
3 branded atxi double strandoi form, Tvp*e<d vectors vavM m site-dbetttecl nsytf^fmsis 
inehide vectors- siith as tiae M13 pha^e. Jhese ^hugc are fsadiiy eommc^fU^y-av^UibJc 
SKtl their use is generally well-known to those skilled in the- £fi. Ttouhle-; traded 
|sLtMT«kh a?e also j«>«Unc!v t*mp|r,>£d m <;ke iJifvc^d taiit agent's:.* ebmn.jU\» the 
step of irMsferdng the gene id" intereM torn a plasmid to a phage. 

if 1b general ttte-dsmJetJ mua§e r &rs« >r aecorJoFce herewith is 

performed by fust obtaining a aingle^strarxied sector or roehing apart of two strands of a 
dt^He-Gtraraksl vector xh&t includes v.jthiii rts ^eque«L->* a 1>NA ^equenu 1 thai enanks 
the desired peptide. An oagonuotcoiide primer bcaruig the dessred metaled se^enee is 
prepared, generally synthetically. This prims?- then annealed with the ssngfe-stroreded 

15 vector, sad subjected ie- DMA polyai*rixing enzymes sudi as F, coli polymerase I 
Ktenow fragment, in order to complete the synthesis or the lautatioEs-be&r-iGg strand. 
Thus, a heterodijples is fornved wherein one strand encodes tin? original non-mutated 
Sequence and the second strand bears the desired mutation, Thk< heterodupies vector h 
then used to transform appropriate cells, 3uch as B. coii cells, aitd clones a?e selected 

20 which include reeomHnam vectors bearing tfe mutated sequence anangecraaL 

Ihe preparation of sequence variants of the selected peptide-encodmg 
DNA segments using ssie-directed mutagenesis provides a means of ppxifeemy 
potentially usefb* species and is aot meant to be. limiting as there are -other ways i?i 
which sequence v&riamts of peptides and the ON A ^quences encoding them may be 

35 ohtim^d ¥ot example, reenttjhitiaftt vector encoding the dented peptide .sequence 
mssy be tres^d with mytsgenic agaus, sych as feydiViKylamhie, to obtain r^naice 
variants. Specitk details regarding these method 1 ; ami protocols are found in she 
te»thiiigs of Maioy e>tU t }9 ( H; Segal 197**; Proltap and Bajpai, 1991: Kuby. 19^4; and 
Manias &? ai : \9&2, each itscorpt^afijd h«r^in by r«fe^nce, for that purpose. 



